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@ Biosynthetic process for the preparation of proteins. 

© A bacterial host is described which is transformed by a plasmid coding for a polypeptide precursor wherein 
the host comprises a multi- enzyme complex capable of reacting with the expressed polypeptide precursor to 
produce a polypeptide comprising at least one dehydroamino acid and/or at least one lanthione bridge. A 
process for producing a polypeptide comprising at least one dehydroamino acid and/or at least one lanthione 
bridge, such as gallidermin, is also described. A plasmid capable of transforming a bacterial host is additionally 
described. 

Also disclosed are recombinant DNA molecules which specify Epi B, Epi C, Epi D, Epi P and Epi Q, 
enzymes which are involved in the biosynthesis of lantibiotic epidermin. 
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Cross - Reference to Related Application 

The present application is a continuation-in-part of U.S. Application Serial No 07/353,590, the 
disclosure of which is fully incorporated by reference herein. 

5 

Field of the Invention 

This invention relates to the biosynthesis of chemical compounds, and in particular to the biosynthesis 
of chemical compounds containing dehydroamino acid residues and/or thioether bridges. The invention also 
70 relates to the use of recombinant genetics to prepare enzymes involved in the biosynthesis of such 
chemical compounds. 

Background of the Invention 

Some polypeptide antibodies such as nisin, subtilin, duramycin, cinnamycin, ancovenin, Ro 09-0198 
and epidermin contain dehydroamino acids and lanthionine bridges. These polypeptides are produced by 
various respective strains of microorganism. Nisin for example can be produced by cultivating strains of 
Streptococcus lactin, and subtilin by cultivation of Bacillus subtUis. 

The genetic basis for the biosynthesis of these antibiotics has not, hitherto, been elucidated. Thus, it 
has not been known, for example, whether biosynthesis of such antibiotics and, in particular, the formation 
of the unusual amino acids found therein occurs via ribosomal synthesis or via multi- enzyme complexes. 

It addition it was not know whether the precursor proteins of such antibiotics were coded by distinct 
structural genes or were the degradation products of larger proteins. 

In the course of work carried out to establish the structural gene of epiderm, we have been able to 
establish that surprisingly the above mentioned antibiotics, in particular epidermin, are each coded by a 
distinct structural gene, and that processing of a presequence polypeptide is carried out by an enzymatic 
complex which effects formation of dehydroamino residues and/or thioether bridges. 

Furthermore, the multi -enzyme complex may be involved in the secretion of the protein through the 
cell membrane into the culture supernatant, as well as processing a prepolypeptide. In this connection, such 
activity may be associated with a pre -sequence possessed by the pre - polypeptide, e.g.. as in the case of 
the -30 to - 1 sequence of pre -epidermin as described below. 

Description of the Drawings 

Figure 1 depicts the nucleotide sequence of the epidermin structural gene (epi A) and the deduced 
amino acid sequence of pre - epidermin. A Shine - Dalgarno sequence is boxed and the proteolytic 
cleavage site at which the propeptide is processed is indicated by an arrow. Inverted repeats are underlined 
and potential stop codons are noted as am (amber) and oc (ochre). 

Figure 2A depicts a prediction plot for pre -epidermin using a Hyron program, in which the respective 
40 bar charts show: (a) flexibility; (b) hydropathy; (c) hydrophilicity; (d) propensities for turn; (e) 0- sheet; and 
(f) a - helix conformation. 

Figure 2B depicts a helix wheel plot for pre -epidermin showing that the N- terminus may partially 
adopt an amphophilic a - helical conformation in an appropriate environment. 

Figure 3 depicts a postulated naturation procedure for epidermin. The translated polypeptide (pre - 
45 epidermin) consists of 52 -amino acid residues. Structure predictions indicate a partially a -helical N- 
terminus from which residues -30 to -10 may form an amphophilic a -helix conformation. Water 
elimination occurs at the indicated Ser and Thr residues (a). With the exception of Thr +1 *, water elimination 
is followed by sulphide ring formation (b) and at the C- terminus, decarboxylation (c) and double bond 
formation (d) to produce pro -epidermin. The pro- epidermin structure is then processed by proteolytic 
so cleavage to produce epidermin. 

Figure 4 depicts the structure of epidermin. The ring structures are designated as A, B, C, D and E. The 
structures of the amino acids mesolanthione and threo - methyllanthione, are set forth. 

Figure 5 depicts examples of unusual amino acids which are found in lanthione antibiotics and which 
can be formed in peptide products using the method of this invention. 
55 Figure 6 depicts a schematic representation of the process for preparation of the pCUl plasmid from 

pCLPlOO plasmid and pUCl8 plasmid. 

Figure 7 depicts the elution pattern of the isolated culture medium prepared in Example 2. 
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Figure 7B depicts the elution pattern of a standard containing gallidermin. Gallidermin is eluted at 7.54 
minutes. 

Figure 8 is a genetic analysts of episome pTU32 of S. epidermis plasmid pTu32, including 
8A: a restriction map of episome pTQ32, anc 
5 8B: a restriction map of the 13.5 kb Bglll fragment of pTU32. The filled arrow corresponds to the epiA 

structural gene. Open arrows represent reading frames epiB, C, D, P and Q. 
8C: Southern hybridization of pT032 digested with different restriction enzymes (EcoRI, EcoRV, 
Bgllll, Sphl) using a 1 5 - mer oligonucleotide (5'CACATCCAGGAGTAC - 3') specific of epiA. 
Figure 9 is a nucleotide sequence of the Bglll/Hpall fragment of pTU32 containing reading frames epiA, 
w B, C, D, P, Q, Y\ and Y" and the deduced ammo acid sequences of the respective proteins. S/D sequences 
and termination structures are overlined. IR indicates inverted repeats. The start of the open reading frames 
of epiY, epiA, epiB, epiC, epiD, epiQ and epiP are indicated by bold letters. The N -terminal amino acid 
residues (possible translational start sites) are boxed. 

Figure 10 shows the results of a Northern blot analysis of epiA (10A) and epiB (10B) expression in S. 
75 epidermidis, in which total RNA (40ug, lanes 1, 3. and 5, or 20 ug, lanes 2, 4, and 6) were separated on 
1 .2% agarose gels and hybridization was performed with an antisense RNA probe (SP6 transcript. Filters 
were washed with increasing stringency; lanes 1, 2:1xSSC, 0.1% SDS, exposition time, 4 h, lanes 3, 
4:0.5xSSC, 0.1% SDS, exposition time 16 h; lanes 5,6:0. 1x SSC, 0.1% SDS, exposition time, 3 days). The 
positions of 235 and 165 RNAs were used as a size standards. 
20 Figure 11 shows sequence homologies between EpiP and different serine proteases at the active sites 

(SUBSI, subtilisin 1168 precursor of B. subtilis (Terzaghi et a/., Appi Microbiol. 29: 807-813 (1975); major 
intracellular serine protease from B. subtilis (Maniatis et al.. Molecular Cloning. A Laboratory Manual: 2nd 
ed. Cold Spring Harbour Laboratory Press (1990); SUMYTV, thermitase from Thermoactinomyces vulgaris 
(Stahl et al., J. Bacterid 158: 411 -418 (1984)). The strongly conserved asparagine (asp), histidine (his). 
25 and serine (ser) residues are marked by asterisks. Similar amino acid residues are indicated by points and 
identical amino acid residues by colons. 

Figure 12 shows sequence homologies between epiQ and PhoB (Makino ef al., J. Mol. Biol. 790:37-44 
(1986)). Similar amino acid residues are indicated by points and identical amino acid residues by colons. 
Figure 13 is a HPLC elution profile of epidermin which was produced in S. camosus TM300. 
30 13A: Elution profile of epidermin standard substance (6.75 min, indicated by an arrow). 

13B: Elution profile of epidermin standard substance (6.75 min, indicated by an arrow) isolated from 
culture filtrates of S.carnosus TM300 pTepi14. Culture filtrates were adsorbed to XAD 1180, 
eluted with methanol and finally concentrated by evaporation. 
13C: Elution profile of untransformed S. camosus TM300 culture filtrate treated as in 13B. The solid 
35 line indicates the elution region of epidermin. 

Figure 14 shows the construction of pT181mcs. The Pvull 309 -Pvull 631 fragment of pUCl9, part of lacZ 
and the multiple cloning site (mcs), was inserted into the single Ndel site within pre of pT181 (Gennaro et 
a/., J. Bacteriol. 769:2601 -2610 (1987); Kahn et a/., Plasmid 70:251 -259 (1983)) by blunt-end ligation. 
lacZ is in the opposite orientation to a pre. Black bar, interrupted pre; open bar, inserted pUC19 fragment. 
40 Figure 15 shows the construction of pCU1. PCLP100 is a derivative of pCl94 (Horinouchi ef a/., J. 

Bacteriol. 750:815-825 (1982)) containing a single Pst1 site which was generated by opening pC194 at 
the Hindlll site, deleting the ends with Bal31 (approximately 950 bp) and inserting a Pstl - linker by blunt - 
end ligation. PCU1 was then generated by blunt -end ligation of pCPUOO and pUC19 (Vieira et al., Gene 
79:259-268 (1982)) via the single Pstl and Ndel sites, respectively. The multiple cloning site (mcs) in front 
45 of lacZ was used for cloning various epi gene - containing fragments. This shuttle vector replicates both in 
staphylococci and E. coli. 
Figure 16 shows: 

A) the generation of pTepi14 by cloning the 14 kb Bglll fragment of pTU32 in pT181 mcs. This fragment 
containing the entire genetic information necessary for epidermin production in S. camosus. The 

50 indicated ORFs and their transcriptional directions (indicated by arrows) are deduced from the DNA 
sequence. epiA, the structural gene, is presented by the black arrow. 

B) various pTepiM DNA fragments subcloned into pT181mcs (pT...) or pCU1 (pCU...). The respective 
plasmids were used to complement the S. epidermidis Epi mutants. The complete ORFs represent in 
the plasmid are indicated. 

55 Figure 17 shows the construction of pPS4epiA and pPS4epiB. pPS4 is a derivative of pLipPSI (Liebl et 

al., Mol. Gen. Genet. 204:166-173 (1986)). A single BamHI site was inserted after a strong staphylococcal 
promoter. Cloning of genes into the BamHI site under the control of the ORF2 promoter normally leads to 
good expression in staphylococci. epiA was PCR - amplified and contained flanking BamHI sites. The 3.2 kb 
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BstNl fragment containing epiB was inserted into the BamHI site by blunt -end ligation. The respective 
EMS - mutants were complemented only when epiA and epiB were under the control of the ORF2 promoter. 
lip, lipase gene; cat, chloramphenicol acetyl transferase gene; ORF2, S. camosus - specific truncated ORF. 
Figure 18 shows the complementation of epidermin production in S. camosus (pTepiABCDQ) by 
5 flanking DNA fragments. The fragments were subcloned into the compatible plasmid pCA44. 

Description of the Preferred Embodiments 



Broadly speaking the present invention provides in one aspect a bacterial host containing a plasmid, 
70 wherein said plasmid codes for a polypeptide which is not normally produced by said host, and wherein 
said host during cultivation provides a multi - enzyme complex whereby a polypeptide is produced which 
contains at least one dehydroamino acid and/or at least one lanthionine bridge, said produced polypeptide 
being foreign to said host. 

A suitable multi enzyme complex is one which is capable of effecting at least one of the following 
75 operations, namely water elimination and sulphide bridge formation; the complex may also effect decar- 
boxylation and double bond formation. 

Suitable hosts for carrying out the process of the present invention are those which, without modification 
of their genetic material, are capable of producing polypeptides containing a dehydroamino acid residue 
and/or lanthionine bridge and/or a methyl lanthionine bridge. Examples of such hosts are Streptococcus 
20 lactis, Bacillus subtilis, Streptomyces cinnamoneus, Streptomyces sp. Streptoverticullum griseover- 
ticillum, Staphylococcus epidermis, Staphylococcus epidermin strain 5, Staphylococcus gallinarum and 
mutant strains thereof, e.g., a mutant strain of S. epidermin DSM 3095 which is incapable of producing 
epidermin. 

Strains which are of special interest are Staphylococcus gallinarum (F16/P57) Tu 3928 which has been 
25 deposited with the Deutsche Sammlung von Microorganismen under the terms of the Budapest Treaty on 
18 May 1988 and has received the depository number TO 3928 in DSM 4616 and Staphylococcus 
epidermis DSM 3095 which was deposited by the present applicants with the Deutsche Sammlung von 
Microorganismen under the terms of the Budapest Treaty on 26th October 1984. 

In order to transform a suitable host, a suitable plasmid may be modified by known genetic engineering 
30 techniques. 

Desirably a plasmid from a host which produces a polypeptide containing at least one dehydroamino 
acid residue and/or at least one sulfide bridge is treated by modifying or replacing the gene coding for a 
pre - polypeptide to provide a plasmid coding for a polypeptide foreign to said host and then transforming 
said host with the altered plasmid. 
35 Any of a variety of methods may be used to replace or modify a gene coding for the pre - polypeptide. 

DNA coding for the pre - polypeptide sequence of the desired compound can be prepared by chemical 
synthesis. Suitable chemical syntheses have been disclosed in Anal. Biochem. 121, 365 (1982). The known 
techniques allow the preparation of polynucleotides, e.g., of up to 60 to 100 bases to be prepared. 

Suitable protected nucleotides can be linked by the phosophotriester method Agarwal et a/., (Agnew, 
40 Chem. 84, 489 (1972)), the phosphotriester method (Reesem., Tetrahedron 39, 3, (1983)) or the 
phosphitetriester method (Letsinger et ai. J. Am. Chem. Soc. 98, 3655 (1976)) or the phosphoramidite 
method. The solid phase method allows for simplification of the synthesis of the polynucleotides. 

The double stranded DNA can be constructed enzymatically from chemically prepared short but 
overlapping segments. 

45 For example, overlapping polynucleotide sequences from both DNA strands can be used, which are 
held together in the correct conformation by base pairing and are then chemically linked by the enzyme 
DNA hgase (Khorana et a/., J. Biol. Chem. 251, 565 (1976)). 

Another possibility comprises incubating in each case one polynucleotide sequence from the two DNA 
stands with a short overlapping segment in the presence of the four required deoxynucleoside triphosphates 
60 with a DNA -polymerase, for example, DNA - polymerase I, the Klenow fragment of polymerase I or T4 
DNA -polymerase, or with reverse transcriptase. The two polynucleotide sequences are thereby held 
together in the correct arrangement by base pairing and are supplemented with the required nucleotides by 
the enzyme to give a complete double - strand DNA (Narany et al. Anal. Biochem. 121, 365 (1982)). 

Another suitable method for obtaining the DNA coding for a polypeptide comprises isolating the DNA 
55 from the genomic DNA of a tissue or cell culture or microorganism, lysing the cells e.g. with SDS or 
proteinase K, or if desired mechanically, and deproteinising the DNA by repeated extraction with phenol. 

The RNA can be preferably digested with RNase. The obtained raw DNA is partially digested with 
suitable restriction enzymes e.g. Haelll and A1ul and fragments isolated and multiplied in a suitable phage 
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or cosmid, e.g. in charon 4A or EMBL-3 phage and assayed for the desired sequences e.g. with a 
radioactively labelled DNA probe. 

The DNA coding for a desired polypeptide can also be obtained by reverse transcription of isolated 
mRNA into cDNA. This may be the preferred method if the DN/ structure is not known. In this method the 

5 DNA is obtained from genomic DNA in a cDNA library via the mRNA. The cDNA library comprises the 
genetic information which is complementary to the mRNA isolated from cells. 

To obtain a cDNA library, the mRNA is isolated from cells expressing the desired basic (possibly 
unmodified) protein. This mRNA is converted into double stranded cDNA. 

Standard methods well known in the art are applied in the preparation of mRNA. The cell membrane is 

70 broken and the cell content released from which the mRNA is isolated. The cell membrane is preferably 
broken by physical methods or lysis with detergents such as SDS, guanidine thiocyanate, definite salt 
conditions or homogenization, preferably by mixing. The mRNA is isolated by the standard methods of 
phenol extraction, ethanol precipitation, centrifugation and chromatography, preferably a combination of 
several methods. Centrifugation is preferably done over gradients, for example over a CsCI gradient. For 

75 chromatography, preferably columns are used, especially oligo-dT columns. 

The total mRNA can be converted directly into Ds-cDNA following the methods of the art. Preferably 
the mRNA coding for a desired polypeptide is further enriched using several techniques, such as 
electrophoresis, chromatography and centrifugation, preferably sucrose gradient centrifugation. 

Fractions containing mRNA coding for a desired polypeptide can be detected by various methods, such 

20 as in vivo or in vitro translations, followed by detection of a relevant activity or, when the nucleotide 
sequence is known, by hybridization with an oligonucleotide probe 

In vivo translation systems can be prokaryotic or eukaryotic systems. A preferred in vivo translation 
system is the Xenopus laevis oocyte system (see Maniatis et a/., Molecular Cloning A Laboratory Manual 
Cold Spring Harbor Laboratory (1982)). In vitro systems are, for example, wheat germ and rabbit 

25 reticulocyte lysates, both of which are commercially available. 

From any pool of mRNA derived from unfractionated or fractionated mRNA, ds-cDNA can be obtained 
by the well known methods of the art (preferred general methods are described in Maniatis et al. (supra), 
Okayam and Berg, Molecular and Cell Biology 2, 161 -170 (1982) and Heidecker, Nucleic Acid Research 
1 /, 4891 -4906 (1983)). In general, the mRNA is converted first to ss-cDNA using reverse transcriptase or 

30 DNA -polymerase I (Klenow fragment). Two methods are alternatively used for priming the synthesis of the 
ds-cDNA. The first method was the natural loop formation of the ss-cDNA. The second method is that of 
tailing the ss-cDNA with a homopolymeric tail such as poly-dC or poly-DT. 

The mRNA fraction of which the corresponding polypeptide shows the highest activity in the detection 
system is transcribed into the complementary cDNA by methods well known in the art. The mRNA and 

35 oligo-dT as a primer are mixed, dNTPs are then added as starting material and the synthesis of the 
cDNA-mRNA hybrid molecule is realized by the enzyme reverse transcriptase. The RNA molecules are 
degraded by addition of NaOH. DNA polymerase is admixed, preferably the Klenow fragment of the DNA 
polymerase I, and the mixture is incubated at a suitable temperature, preferably 12-15'C. The mixture is 
incubated with nuclease S1 and the ds-cDNA corresponding to the mRNA coding for a desired polypep- 

40 tide is obtained. 

For amplification the obtained ds-cDNA can be spliced into suitable vector e.g. the plasmid pUC-KO 
and the obtained hybrid vector multiplied by use of a suitable host, e.g. E. Coli HB101. Reisolation of the 
hybrid vectors, and recovering the isolated cDNA therefrom allows a structure determination of the DNA 
coding for a desired polypeptide. 

45 

Preparation of a Hybrid Vector 

A hybrid vector of the invention can be prepare by splicing a DNA coding tor a polypeptide of the 
desired sequence into a suitable vector. 
50 Suitable vectors are carriers for integrated passenger DNA, which can be used to transform a host 

microorganism. 

Suitable as vectors are plasmids derived from microorganisms which in an untransformed state produce 
polypeptides which contain dehydroamino and/or sulfide groups. Suitable vectors carry the insert DNA at a 
defined position. 

55 In general, such vectors may contain a replicon and a control sequence, i.e. a promoter, which are 

derived from the host cell or a species compatible with the host cell in which they are used. The vector 
ordinarily carries a replicon site and may contain sequences (marker genes) which are capable of providing 
phenotype selection in transformed cells. Suitable marker genes may provide antibiotic resistance or 
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resistance to heavy metals or they may complement a genetic defect of the host. Further useful sequences 
in such vectors are enhancer and activator sequences. 

One suitable starting vector is a 54 Kbp plasmid pEpi32 from the strain Staphylococcus epidermis 
DSM 3095. This plasmid, which is characterized below, contains the ep/A gene encoding for a 52- 
5 propeptide, which is processed to a tetracyclic 21 - peptide amide antibiotic. A vector carrying a passenger 
DNA is designated a hybrid vector. 

The desired DNA is spliced into the starting vector by conventional methods. 

A starting plasmid for example can first be linearised by a suitable restriction enzymes, e.g. the plasmid 
pEpi32 by Hindlll, BamHI and EcoRI, then d/G - tailed in the presence of dGTP and the terminal 
10 deoxynucleotidyl transferase. The double stranded cDNA insert is dC- tailed in the presence of dCTP and 
terminal deoxynucleotidyl transferase. Combining both cDNA and vector results in the hybrid vector. 
Bacteriophages, such as lambda, are preferred for constructing genomic libraries. The lambda cloning 
systems are described by Maniatis (supra). The suitable vector DNA is digested to completion with the 
appropriate restriction enzyme, and the left and right arms are separated from the central fragments by 
75 velocity gradient centrifugation or gel electrophoresis. Another method is to digest parts of the stuffer 
fragments with restriction enzymes which lack recognition sites in the left and right arms. The isolated 
genomic DNA can be partially digested to fragments of 13-20kb in length. Afterwards the arms are ligated 
with the fragments of foreign DNA having termini compatible with those of the arms. 

The appropriate DNA insert is recloned from the original vector used for the original cloning, into a 
20 suitable expression vector. To this end appropriate restriction enzymes are used, possibly in combination 
with oxonucleones, to produce the desired DNA fragments. 

The DNA insert may be subcloned into a multiple site of a suitable well known plasmid vector e.g. 
derivatives of pC194, pTI81 and pUBHO at the restriction sites Hindlll'BamHI/EcoRI. 

The method of the invention can thus be used to prepare derivatives of known peptides and hormones. 
25 in which a cystein residue in the unmodified peptide is replaced by sulfide - bridged amino acids and serine 
and thiamine are replaced by corresponding dehydroamino acid residues. 

These fragments are integrated into an appropriate expression vector by using the cohesive ends 
directly or by the addition of appropriate chemically synthesized oligonucleotide bridges. For the modifica- 
tion of the ends for example Hindlll and BgLII can be used. The method is not limited to any special 
30 restriction enzymes. Any desired link can be made between the expression vector and the DNA insert using 
suitable restriction enzymes in combination with chemically synthesized oligonucleotides. 

Appropriate DNA inserts can also be obtained which code for polypeptide having site directed 
mutagenesis. 

A variety of methods may be used to induce mutations of underlying DNA so as to prepare the desired 
35 mutants. 

One method may comprise first inserting a fragment of a native or basic gene, containing sequences 
coding for the region to be mutated, into the replicative form of a phage, e.g. phage MI3mp8 to form 
MI3mp8PA. A synthetic oligonucleotide, complementary to the inserted sequences but containing one or 
more nucleotidetriplets which code for the amino acid to be substituted, is then annealed to the single 
40 stranded form of MI3mp8A to form a double stranded region. This region serves as a primer for DNA 
polymerase I synthesis of the remaining complementary strand. After replication and identification, the 
mutant sequence may be further modified or used to construct a suitable vector for expressing the mutated 
polypeptide. 

In the work carried out on epidermin a wobbled DNA probe 5' - GTG(A)CAT(G/A)ATG(A)AAT(C)TT - 3' 
45 deduced from a suitable pentapeptide segment of the proposed pre -sequence of epidermin 
LysPhelleCylThr was prepared. This DNA probe was hybridized against plasmid DNA from S. epidermin 
DSM 3095. 

Restriction analysis of the isolated plasmid reveals seven DNA fragments with EcoRI (16, 11. 10, 6.5, 
5.5., 3.5 and 2.5 kbp), nine DNA fragments with Hindlll (17, 14, 10, 5.3, 2.8, 1.8, 0.8, 0.6 and 0.5 kbp) and 
so five DNA fragments with BamHI (20, 19, 10. 3 and 1 kbp). 

A 5.4 kbp Hindlll fragment was subcloned and subjected to rehybridization whereby the structure gene 
epiA was located within a 2.2 kbp EcoRI/Bglll fragment. 

As a mixture of 24 different 14-mers was used as a hybridization probe The probe was applied in a 
30 -fold excess as a sequencing primer in accordance with the techniques described in Novick et a/. Ann. 
55 N.Y. Acad. Sci. 182, 279-294 (1971), Southern, J. Molec. Biol. 98, 503-517 (1975) and Heinrich et a/., 
Molecul. gen. Genet. 209, 563-569 (1987). The peptide sequence of epidermin allowed identification of 
the open reading frame. A single methionine codon is in appropriate distance to a Shme - Dalgaro 
sequence. The structural gene of pre - epidermin terminates at the TAA stop codon, hence pre - epidermin 
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consists of 52 amino acids (Figure 1) and it is processed to the epidermin between Arg~ 1 and lie' 1 . Thus, 
as can clearly be seen, pre - epidermin is not a degradation product of a larger protein but is coded by a 
distinct structural gene. 

Thus, it is apparent that, unexpectedly, the precursor protein of the antibiotics are coded by distinct 
5 structural genes. 

A combination of prediction profiles for secondary structure (a, 0, turns), flexibility, hydropathy, 
hydrophilicity (Figure 2A) and helix wheel plot were made using a Hycon program (Figure 2B). A high a- 
helix probability is predicted for pre - epidermin -30 to -8 whereas the C- terminal part 1-22 which 
corresponds to pro - epidermin exhibits very high turn probability. Moreover, the prediction plot shows 
10 clearly, that the N- terminus -30 to - 1 highly hydrophilic, whereas the C- terminal part is more lipophilic. 
The N - terminal part - 30 to - 8 seems to fold partially into an amphophilic a - helix. 

The N - terminal segment of pre -epidermin -30 to -1 does not contain any cysteine residues, 
whereas the C- terminal segment 1 -22 contains the four cysteine residues, involved in sulphide bridge 
formation. Sequence - 30 to - 1 included many cleavage sites for endoproteases whereas even in the 
75 pre -epidermin state, sequence 1 -22 is highly resistent to proteolytic degradation. 

The mature antibiotic can only be attacked by trypsin at Lys in position 13. The processing site Arg~ 1 - 
He* 1 is hydrophilic and accessible, due to the turn forming Pro -2 residue. 

The various enzymatic reactions which occur in the production of the antibiotics such as epidermin 
include modifications of the propolypeptide part 1-22; cleavage of the N -terminal propeptide fragment 
20 -30 to - 1 and secretion of the matured antibiotic (see Figures 3 and 4). 

The enzymatic modifications occur before cleavage of the prepeptide fragment. Enzymatic modification 
includes the elimination of water from Ser and Thr residues in position 5, 16, 19 and 8, 14 respectively to 
form dehydroalanine and dehydrobutyrine residues. Addition of thiol groups of Cys residues in position 2, 
11, 21 and 22 to the C = C double bonds, also occurs, yielding the meso - lanthionine or (2S 3S, 6R)-3- 
25 methyl - lanthionine bridges. In addition, decarboxylation of residue 22 and double bond formation yields the 
C- terminal S- (2 -aminovinyl) - D -cysteine. The reaction of C -terminally situated cysteine thiol groups 
with N -terminally located dehydroamino acids occurs with complete stereo - specificity in epidermin, nisin 
and subtilin. Accordingly, during modification these elimination - addition reaction imply a reversal of 
configuration of the Ca carbon atoms at pre -epidermin residues L-Ser and L-Thr to give D- configured 
30 Ca atoms. On the other hand, the L -configuration of the cysteine halves is still maintained. 

The four sulphide rings are also formed, subsequently at the same catalytic site, which is supported by 
the interaction with the N - terminal amphophilic a - helix. Only Thr +14 dehydrates without finding a cysteine. 
This position (Lys +13 - Dhb +U ) constitutes the enzymatic cleavage site at which trypsin inactivates the 
antibiotic epidermin. During sulphide ring formation C- terminal rigidity and hydrophobicity increases and 
35 may favor interaction of pro- epidermin with the lipid bilayer and may induce translocation. 

Finally, the hydrophilic a -helical N - terminus -30 to -1 is cleaved by a specific protease at the 
characteristic cleavage site described above. 

Using the techniques described above plasmids coding for lantibiotics can be modified either by 
mutation of the gene coding for the respective polypeptide or by replacement of such a gene by a gene 
40 coding for a different polypeptide and used to transform the original host or a different host, provided such 
host also, in its native state, is capable of expressing a lantibiotic. 

Generally speaking, where the original functional gene codes for a pre - sequence, as discussed above 
for example in the case of epidermin, the DNA sequence coding for such a pre -sequence may be retained 
in the modified plasmid; in this case the DNA -sequence for the new, or mutated pro - polypeptide will be 
45 positioned directly upstream of the pre -sequence DNA similarly to the original pro - polypeptide sequence. 
Cultivation of a bacterial host according to the present invention may be carried out under convention - 
ally used cultivation conditions as described for instance in our co - pending British Patent Application No. 
8811760.1 which was filed on 18th May 1988 and in European Patent Application Publication No. 0 181 578. 
Purification and isolation of the desired protein may also be carried out using the techniques or suitable 
so modifications thereof described in the foregoing patent applications for epidermin and gallidermin, including 
the use of adsorbents, ion -exchange resins and if desired HPLC. 

The process of the invention can be applied to the formation of novel compounds for experimental 
purposes, or to the formation of known compounds or derivatives of known compounds in new hosts. For 
instance a plasmid containing the gene coding for epidermin can be used to transform the species 
55 Streptococcus factis to produce epidermin from that host, or the gene coding for Gallidermin (see our 
co -pending British Patent Application referred to above) can be used to replace the gene coding for the 
pro -polypeptide for epidermin in e.g. plasmid pEpi32 and used to transform Staphylococcus epidermis 
DSM 3095 to produce gallidermin from this host. Similarly other biologically active peptide derivatives 
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containing dehydroamino acid residues and/or lanthionine bridges and/or methyllanthionine bridges can be 
produced, such as derivatives of hormones such as human insulin, oxytocin, vasopressin, peptide antibiot- 
ics, hormone inhibitors such as elastase inhibitor and fibrinolytically active agents such as human tissue 
plasminogen activator. Such derivatives, as well as retaining biological activity of the parent compound can 
5 have increased stability and improved half -lives. 

Ideally the DNA coding for the desired pro - polypeptide should include codons for cystein and serine 
and/or for cysteine and threonine for the formation of thioether bridges. 

For relatively short chain polypeptides these respective codons should normally be no more than eight 
and preferably no more than six codons apart, inclusive, although it is envisaged that, depending upon the 
w steric conformation of the final polypeptide molecule much greater spacing is possible. 

In respect of the formation of dehydroamino acids these will usually be derived from serine and 
threonine and, accordingly the DNA coding for the desired pro - polypeptide will include codons for such 
amino acids. 

Amongst the unusual amino acids which may be present in a polypeptide produced according to the 

?5 present invention are, dehydroalanine, 2,3 - dehydro - 2 - aminobutyric acid, meso - lanthionine, (2S, 3S, 
6R) - 3methyl - lanthionine, S - (2 - (Z) - aminovinyl) - D - cystein, lysinoalanine and £ - hydroxyaspartic 
acid; the structure of these residues are shown in Figure 5 

We have unexpectedly found that the multi enzyme complex responsible for the posttranslational 
modification of pre -epidermin is located on the 54kb plasmid pTQ32 of Staphylococcus epidermis Tu 

20 3298'DSM 3095 Plasmid pTU32 is sometimes also referred to herein as pEpi32. 

The six genes (ORFs) responsible for the production of epidermin are designated herein epi A, B, C, D, 
Q and P and are clustered within 8kb and the proteins for which they code are designated Epi A, B, C, D, Q 
and P respectively; epi A encodes the 52 amino acid -long preepidermin. As described below, epi B, C and 
D are involved in the four enzymatic modification reactions (i) water elimination by a serine/threonine 

25 dehydratase, (ii) sulfur addition by a lanthinonine synthase, (iii) C- terminal decarboxylation by a cysteine 
decarboxylase and (iv) double bond formation. Epi P protein is believed to be responsible for cleaving the 
mature epidermin from the N -terminal leader peptide, based on its striking homologies with the essential 
domain of serine proteases (Koide et a/., J. BacterioL 767:110-116 (1986); Meloun et a/., FEBS Lett. 
733:195 - 200 (1985); and Stahl et a/., J. Bacterid 755:411 -418 (1984)) whilst Epi Q is believed to be a 

30 regulatory protein regulating epidermin biosynthesis, based on its distinct homology to the pho B gene of E. 
coli (Makino et aL, J. Moi Biol. 790:37 - 44(1986)), the fact that both proteins are of a similar size with 205 
(epi Q) and 229 (pho B) amino acid residues, the observed homology of 24.2% extending over the 153 C- 
terminal amino acid residues and the hydrophilicity plots of both proteins. 

As a result of the unexpected finding of the entire genetic information for the epidermin biosynthesis 

35 and the elucidation of the genes for the proteins epi B, C, D, Q and P, it is now possible to obtain the 
isolated DNA coding for the proteins, and to construct plasmids containing one or more of these genes so 
that upon cultivation of a host containing such plasmids one of these proteins alone or predetermined 
combinations of the proteins may be expressed and subsequently isolated. 

The present invention therefore includes DNA sequences encoding respectively for the protein Epi B or 

40 Epi C, or Epi D, or Epi P or Epi Q. These sequences may be isolated DNA either single or double stranded, 
obtained by cleavage of and isolation from pTG32 in known manner or obtained by chemical synthesis or 
any other conventional procedure. The DNA may also be integrated in a plasmid, suitably an expression 
plasmid and under the control of a promoter regulator; such constructs when transformed into a suitable 
host which is then cultivated will express the protein Epi B, Epi C, Epi D, Epi P or Epi Q or combination of 

45 these proteins according to which DNAs were ligated into the plasmid. Alternatively plasmid pTQ32 may be 
treated with suitable restriction nucleases to excise one or other of the DNA sequences, followed by 
religation after any necessary modification of the free ends of the digested plasmid, so as to create a 
modified plasmid containing DNA sequences coding for predetermined ones of epi B, C, D, P and Q. 

A further variant comprises the substitution of the gene coding for epidermin in pTG32 with a DNA 

so sequence coding for a predetermined amino acid sequence whereby cultivation of a suitable host with the 
modified plasmid will result in expression of a protein different from epidermin. 

It is thus possible to substitute a DNA sequence encoding for gallidermin or mutant epidermin or other 
lantibiotic or other protein, for the epidermin coding sequence in pTG32 whereby the resulting plasmid can 
be transformed into a suitable host which may be a host normally incapable of producing a lantibiotic or any 

55 of the proteins Epi B, C, D, P or Q and to cultivate the host under conditions whereby the substituted DNA 
sequence and the genes epi B, C, D, P and Q are expressed, so as to obtain a protein which is gallidermin, 
mutant epidermin or other protein containing at least one structural feature of a lantibiotic. 
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Alternatively the genes coding for the proteins Epi B, C, D, P or Q may be inserted into a suitable 
vector, together with a DNA sequence encoding a predetermined amino acid sequence, the genes coding 
for the Epi proteins and the predetermined amino acid sequence being operably connected with suitable 
promoter regulator functions, the resulting plasmid being transformed into a suitable host whi^h may be a 

5 host normally incapable of producing a lantibiotic or any of the proteins Epi B, C, D, P or Q, and the host 
cultivated so that the inserted genes cause the expression of a protein derived from said predetermined 
amino acid sequence but containing a lantibiotic structural feature, which protein may be gallidermin, 
epidermm, mutant epidermin, or another protein. 

The present invention thus also includes within its scope DNA sequences capable of hybridizing, 

io preferably under stringent conditions, with the DNA sequences described herein and coding for proteins 
having substantially the activity of the proteins Epi B, C, D, P or Q. Stringent hybridization conditions select 
for DNA sequences of greater than 85% or, more preferably, greater than about 90% homology. Screening 
of the cDNA library may be carried out under highly stringent conditions according to the method described 
in European Patent Application No. 88 119 602.9 and Kashima et al . ( Nature 313:402-404 (1985)). The 

75 DNA sequences capable of hybridizing under stringent conditions with the DNA sequences disclosed in the 
present application may be, for example, allelic variants of the disclosed DNA sequences, may be naturally 
present in the particular microorganism but related to the disclosed DNA sequences, or may derived from 
other sources General techniques of nucleic acid hybridization are disclosed by Maniatis, T. et al ., In: 
Molecular Cloning, a Laboratory Manual, Cold Spring Harbor, NY (1982), and by Haymes, B.D. et al ., In: 

20 Nucleic Acid Hybridization, a Practical Approach, IRL Press, Washington, DC (1985), which references are 
herein incorporated by reference. The proteins Epi B, C, D, P and Q are valuable and interesting new 
reagents potentially useful in the preparation of novel proteins or other substances containing structural 
features such as dehydroalanine, dehydrobutynine, meso - lanthionine, 3 - methyl - lanthione, and S-(2- 
aminovinyl) - D - cysteine. 

25 As such, they may be utilized as isolated proteins, or as chemical catalytic reagents in chemical 

synthesis procedures to investigate the extracellular processing of proteins by such enzymes. 

The invention also relates to the proteins Epi B, C, D, P and Q in substantially pure form. By the term 
"substantially pure" is intended that the protein is free of the impurities that are naturally associated 
therewith. Substantial purity may be evidenced by a single band by electrophoresis. 

30 The polypeptides of the invention may be isolated and purified from the above -described recombinant 

molecules in accordance with conventional methods, such as extraction, precipitation, chromatography, 
affinity chromatography, electrophoresis, or the like. Preferably, the polypeptides are produced as part of a 
fusion protein which further comprises an auxiliary protein. Such auxiliary which facilitates the isolation and 
purification of the polypeptide of interest. Such auxiliary proteins include, for example, typical secretion 

35 signals, the maltose binding protein from E. coli or protein A. Methods for preparing fusion proteins 
comprising protein A, the purification thereof by immunoaffinity chromatography, and the cleavage thereof 
to release the protein of interest is taught, for example, in PCT Application Publication No. WO84/03103 
(1984). 

A necessary condition to permit cleavage of the fusion protein is that it contains a unique cleavage site 
40 which may be recognized and cleaved by suitable means. Such a cleavage site may be a unique amino 
acid sequence recognizable by chemical or enzymatic means and located between the desired protein and 
the auxiliary protein. Such a specific amino acid sequence must not occur within the desired protein or 
auxiliary protein. Examples of enzymatic reagents include proteases such as collagenase which may 
recognize the amino acid sequence NH 2 - Pro- X- Gly - Pro- COOH, wherein X is an arbitrary amino acid 
45 residue, e.g. leucine; chymosin (rennin) which cleaves the Met-Phe bond; kallikrein B which cleaves on the 
carboxyl side of Arg in X-Phe-Arg-Y; enterokinase which recognizes the sequence X - (Asp) n - Lys - Y, 
wherein n = 2-4, and cleaves it on the carboxyl side of Lys; thrombin which cleaves at specific argmyl 
bonds. Examples of chemical agents which may be used to cleave the fusion proteins include cyanogen 
bromide which cleaves after Met; hydroxylamine which cleaves the Asn-Z bond wherein Z may be Gly, 
50 Leu or Ala; formic acid which in high concentration f 70%) specifically cleaves Asp -Pro. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding description, 
utilize the present invention to its fullest extent. The following preferred specific embodiments are, therefore, 
to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way 
whatsoever. 
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EXAMPLES 
Example 1 



5 1 . Overproduction of gallidermin 

A DNA fragment containing the open reading frame of gallidermin can be cloned in Staphylococcus 
epidermidis DSM 3095, the epidermin producing strain by using a medium copy plasmid such as pC194, 
pE194, pUB110, pT181 or pMK148 gallidermin. An increase of the gene doses usually correlates with an 
70 increase of product production; the correlation is not necessarily linear. High copy number plasmid 
derivatives of pCl94 or pT181 can be used as cloning vehicles too. 

2. Exchange of leader sequence 

75 The leader - sequence of epidermin corresponding to amino acids - 1 to -30, is involved in the 

secretion of epidermin. The sequence can be used to secrete other peptides in S. epidermidis such as 
gallidermin. 

The leader - sequence DNA can be made portable by inserting respective linkers at the beginning and 
at the end of its sequence. Thus the leader sequence DNA can be isolated in large amounts from the 
20 plasmid and can be inserted at respective positions of other peptides and proteins. The leader - sequence 
DNA can also be produced by chemical synthesis. 

Example 2 



25 Production of Gallidermin using S. epidermis as host 
1 . Preparation of plasmid (see Figure 6) 

a) Plasmid pCUl was prepared by ligating Pst1 digested pCLP100 and Ndel digested pUCi8 using 
30 Klenow as described in the thesis "Molekular genetische Untersuchungen zur plasm idkodierten Arsenit 

und Arsenatrestistent bei Staphylococcon", by Dr. Ralf Rosenstein (available from the Technische 
Universitat, Munich, West Germany). The resulting plasmid was then digested with EcoR1. 

b) Chromosomal DNA was isolated from S. gai/inarum (DMS 4616) and was digested with EcoRl. A 4.7 
kb fragment containing the gallidermin structural gene in a 2.4 kb long sequence between Hindill and 

35 EcoRl restriction sites was isolated using as a primer the sequence. 
5* CAC ATC CAG GAG TAC 3' 

c) The 4.7 kb Fragment was then ligated into the EcoRl site of the digested pCUl plasmid from step a) 
to give a plasmid designated pCUgdml . 

40 2. Preparation of a S. epidermis host 



In this example a mutant strain of S. epidermis DSM 3095 incapable of producing epidermin was 
isolated. 

The mutagenesis was carried out on a strain which was characterized by chromosomally coded 
45 Rifampicin resistance (20 ug/ml). 

S. epidermis DSM 3095 grown on Agar plates was used to inoculate 30 ml basic broth medium which 
was cultivated overnight. 0.5 ml of the overnight cultivation was then used to inoculate 50 ml of production 
medium which was shake cultivated at 37 * C for three hours. 

Cells were removed from the cultivation medium and suspended in 4.5 ml pre - warmed TM - Buffer (30 
so mM Tris-Maleate pH 6.5 (the resulting solution is designated Solution A)). 

The solution A was checked for spontaneous mutations and for cell count (1.25 x 10 10 cells/ml). 

4 ml of solution A was thoroughly shaken with 1 ml ethyl methyl sulphonate (final concentration 47 
ug/ml) and then maintained under shaking at 37 *C for one hour. 

Cells were then extracted from the cultivation broth, washed twice in TM - Buffer and resuspended in 5 
55 ml TM- Buffer (the resulting solution was designated Solution B and contained mutated cells). 

Solution B was found to contain 2 x 10 8 cells/ml which corresponds to survival rate of 1.6%. 

50 ml of solution B was added to 5 ml production medium and grown overnight at 37 'C (phenotypic 
expression). The resulting solution was designated Solution C. A cell count showed 7.3 x 10 8 cells/ml. 
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The solution was plated on BM-Agar plates and individual colonies were picked out. These were used 

to inoculate test plates (consisting of BM-Agar to which Micrococcus luteus has been laid on the 

surface). Those colonies which bad no inhibitory effect on M. luteus were selected as non - producers of 

Epidermin. 
5 BM Agar contains per liter: 

10 gm Peptone No. 140 

5 gm Yeast extract 

1 mg Glucose 

5 mg NaCI 
w 1 mg K 2 HPCU 

pH 7.5 

A mutation rate of about 3% was noted. 

The 45 non - producers which were found were sub -cloned 20 times to yield 16 stable non - 
producers. 

75 All stable non - producers were found to contain the wild type plasmid pEpi32. From the restriction 

pattern this is identified as identical to the plasmid in the wild type strain. 

Transformation of non - producing S. epidermis 

750 ml of BM- medium was inoculated with 5 ml of medium obtained by overnight cultivation of a 
stable non - producing strain, and the inoculated medium was shake cultivated in a 2 liter flask at 37° C with 
a shake speed of 120 rpm. 

The initial optical density of the inoculated BM- medium was 0.03-0.04. When the optical density had 
reached 0.45-0.55 the cells were removed by centrifugation in a GS. -3- Rotor at 8500 rpm for 15 
minutes at 4'C. The isolated cells were then washed successively in 750, 350, 40 and 10 ml of 10% 
glycerin, suspended in 2-3 ml 10% glycerin, and frozen in 110 ml portions in ERGs at -70°C. The cell 
count amounted to 1 - 5 x 10 10 /mL 

The frozen cells were thawed at room temperature for 5 minutes, then 50 ul of cell suspension was 
incubated in an ERG with 2 ul plasmid pCUgdml in TE- Buffer for 30 minutes at room temperature. 

The mixture was then introduced into an electroporation cuvette having a 0.2 cm electrode gap and 
immediately electroporated. Thereafter the cells were rapidly resuspended in 950 ul SMMP50 - medium, 
transferred into a 2.5 ml ERG and shaken for 90 minutes at 37 *C. The ERGs were inclined at 45° in order 
to provide for a good aeration of the medium. 

SMMP50- medium contains pro 100 ml, 55 ml 2SMM, 40 ml 4 PAB and 5 mol 5% BSA. The 2SMM 
contains 1 mol saccharose, 0.04 mol maleic acid, 0.04 mol MgCI 2 and NaOH to pH 6.5 4 PAB is a solution 
of 7 g/100 ml of Gibco antibiotic medium 3. 

The cell suspension is diluted and spread on a BM - Agar containing gallidermin which is incubated for 

20 hours at 37 °C. 

Testing of growing strains which produce gallidermin was carried out by selection of colonies from a M. 
luteus test plate and by cultivating the respective selected colonies and determining the presence of 
gallidermin by HPLC. 

Three pCUgdml transformed mutants capable of producing gallidermin were located. 
Determination of the presenc e of gallidermin produced by pCUgdml transformed S. epidermin 

45 

a) Bio assay 



FP-Agar was inoculated with M. luteus ATCC 9341 and incubated at 37 *C for 18 hours. Half of the 
produced culture was removed with a loop and suspended in 100 ml FP- medium and was cultivated for 8 
50 hours at 36 *C The cultivation was stopped when the optical density reached 1.0. FP-Agar was inoculated 
with 0.5% of this suspension, each 10 ml was poured into a Petri dish and stored for 3 weeks at 4" C. 

The Plate diffusion test was carried out as described in Zahner and Maas, "Biology of Antibiotics", 
Springer Verlag, Berlin 1972. 10 ul of culture filtrate from cultivation of the transformed S. epidermin was 
captured on a filter paper and dried. The paper was placed on the test plate which was then incubated for 
55 24 hours at 37 ' C. 
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b) HPLC 

The selected transformed strain was cultivated for 26 hours in the production medium. The culture broth 
was centrifuged for 10 minutes at 13.000 rpm. 

The isolated culture liquid was then subject to HPLC on a SP 8.700 liquid chromatography apparatus 
(Spectra Physics, Darmstadt, FRG) using as the mobile phase A) H 2 0 with 0.5% 70% perchloric acid and 
B) Acetonitrile. Column packings were Nucleosil - 100 C- 18 of grain size 7 urn and column sizes 125 mm 
x 4.6 mm I D. and 20 mm x 4.6 mm ID for the pre -column. 

Gradients were as follows: 



75 



time (min.) 


A [%] 


B [%] 


0 


77.5 


22.5 


8 


63.0 


37.0 


8.5 


0 


100 


9.5 


0 


100 


10 


77.5 


22.5 


14 


77.5 


22.5 



The resulting chromatogram is shown in Figure 7A. A standard curve is shown in Figure 7B showing 
that galiidermm elutes at 7.54 minutes. 

The following were used as culture medium. 



1 . FP - Agar 


Meat extract 




Peptone 


10 g 


NaCl 


3g 


Na 2 HPO* 


59 


Glucose 


10 g 


Complex agar 


15 g 


Water 


1 liter 


pH 


7.2 



2. FP - Medium 


Meat extract 


4g 


Peptone 


10 g 


NaCl 


3g 


Na 2 HPO* 


5g 


Glucose 


10 g 


Water 


1 liter 


PH 


7.2 



3. Production medium 


Meat extract 


33 g 


Malt extract 


30 g 


NaCl 


40 g 


Calcium Hydroxide 


3.8 g 


Water 


1 liter 


PH 


6.5 
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Example 3 



Plasmid Isolation 



5 Plasmid DNA from S. epidermis TQ3298 was isolated according to a modified procedure of Norick et 

al.,Ann. NY- Acad. Sci. 182:279-294 (1971). S. epidermis was grown on BM - media (1% peptone 140, 
Gibco, Neu-lsenburg, F.R.G., 0.5% yeast extract, Difco, Detroit, USA, 0.1% glucose, 0.5% NaCl and 0.1% 
K2HPO4 x 2H 2 0) until stationary phase. Cells were centrifuged and washed twice with 0.5 M EDTA. The 
pellet was resuspended in 80 ml NaCl buffer (50 mM Tris/HCI, pH 7, 50 mM EDTA, 2.5 M NaCl), 1.5 ml 

70 lysostaphin solution (0.5 mg/ml. Sigma, Heidelberg, F.R.G.) was added and the suspension was incubated 
at 37° C for 20 min. Cells were lysed by the addition of 80 ml lysis buffer (50 mM Tris/HCI, pH 8, 300 mM 
EDTA, 500 mM Brij., 40 mM sodium deoxycholate and kept on ice for 1 h. The lysate was centrifuged (30 
min, 13,000 rpm, 4°C) and the supernatant was mixed with one quarter of its volume with 50% solution of 
PEG -6000. Plasmid DNA was precipitated at 4*C overnight. The DNA suspension was centrifuged (20 

75 min, 13,000 rpm, 4 # C), resuspended in 8 ml TE buffer and 50 ul of proteinase K solution (20 mg/ml) was 
added. After incubation at 37* C for 15 min the DNA was precipitated with ethanol and further purified by 
CsCI centrifugation (1 g CsCI/ml, 40,000 rpm, 40 h, 20 * C). 

RNA isolation and electrophoreses 

20 

S. epidermin was grown on SMS minimum medium (Terzaghi et al., Appl Microbiol. 29:807-813 (1975)) 
and RNA isolated therefrom, using a modified procedure similar to that described for Baciiius subtilis RNA 
(Ulmanen et a/., J. Bacterid. 762:176-182 (1985)). Cells were lysed with lysostaphin (0.1 mg/ml) in 
protoplasting buffer and incubation was performed at 37* C. Total RNA was glyoxylated (McMaster et al., 
25 Proc. Natl. Acad. Sci. USA 74:4835-4839 (1977)) and separated on a 1.2% agarose gel using 10 mM 
Na?P0*. pH 7, as electrophoresis buffer. RNA was stained with ethidium bromide and blotted to a 
nitrocellulose membrane (Scheider and Schuell, Dassel, F.R.G.) by capillary transfer with 20xSSC buffer 
(0.15 M NaCl, 0.015 M tri sodium citrate, pH 9). 23SrRNA and 16SrRNA were used as size standards. 

30 In vitro transcription 



Single stranded RNA probes were obtained by cloning the respective fragment in a pSPT18/l9 vector 
system (Boehringer Mannheim, Mannheim, F.R.G.). The plasmids were linearized with EcoRI or Hindlll to 
get a linear DNA template. For transcription the protocol in Melton et al., Nucl. Acid Res. /2:7035-7056 
35 (1984), was modified according to the instructions of the commercial supplier. T7-RNA polymerase or 
SP6-RNA polymerase was used in the presence of a 22 P-CTP (800 Ci/mMol). Unincorporated 
ribonucleotides were separated from labeled RNA by Sephadex G50 chromatography. 

Northern hybridization 

40 

RNA was transferred after electrophoresis according to Thomas, P.S., Proc. Natl. Acad. Sci. USA 77: 
5201 -5205 (1980). After 2 h incubation at 80 °C the filter was shortly incubated in 20 Tris/HCI, pH 8, at 
100° C to reverse glyoxylation. Afterwards filters were prehybridized at 42 *C in 50% formamide, 5xSSC 
(0.15 M NaCl, 0.015 M tri sodium citrate, pH 9), 50 NaPO*. pH 6.5, 0.1% ficoll 400 (Pharmazia, Freiburg, 

45 F.R.G.), 0.1% polyvinylpyrollidone. 0.1% bovine serum albumin and 0.25 mg/ml denatured salmon sperm 
DNA for 2h. After probe addition hybridization was performed in the same buffer at 42 *C for 12 h. Filters 
were washed once in IxSSC, 0.1% SDS at 42 *C for 15 mm and exposed to Kodak -X Omat films at 
-70*C for 4 h. Thereafter filters were washed twice with 0.5 SSC, 0.1% SDS at 70 *C for 15 min and 
autoradiograms were exposed at -70"C for 16 h. Next day washing was continued with 0. IxSSC, 0.1% 

50 SDS at 70 ' C for 30 - 60 mm and afterwards again exposed to Kodak - X Omat films at - 70 * C for 3 days. 

Southern hybridization 



For southern hybridization (Southern, E.M., J. Mol. Biol. 35:503-517 (1975)) 5' labeled oligonucleotides 
bb were used as probes at 23 °C. Oligonucleotides were labeled with gamma 32 P-ATP using 4T polynucleotide 
kinase (Boehringer Mannheim, Mannheim, F.R.G.). Oligonucleotides and primers were synthesized on a 391 
DNA synthesizer (Applied Biosystems, Weiterstadt, F.R.G.) and used without further purification. 
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DNA sequencing 

DNA was sequenced radioactively and non - radioactively by the chain termination method (Sanger et 
a/ , Proc. Natl. Acad. Sci. USA 74:5463-5467 (1977)) using T7 - DNA polymerase (Pha mazia, Freiburg, 

5 F.R.G.). Radioactive plasmid sequencing was performed as described in Hattori et al., Anal. Biochem. 
1 52:232 -238(1984) with appropriate primers. The 3.6 kb BamHI/Pstl fragment was sequenced non- 
radioactive^ on an Applied 373A DNA sequenator (Applied Biosystems, Weiterstadt, F.R.G.). The respec- 
tive fragment was cloned in phagemid pBSK-/ + . The construction was digested with BamHI and Sacl and 
the linearized DNA was unibidirectionally digested from the 5* end with exonuclease III (Boehnnger 

w Mannheim, Mannheim, F.R.G.) to obtain a set of nested deletions which were treated with mung bean 
nuclease (Boehringer Mannheim, Mannheim. F.R.G.) to receive blunt ends. After electrophoresis (1% 
agarose gel) fragments of appropriate size were isolated from the gel, religated and transformed into E. coli 
strain XL-1 Blue. Single stranded DNA was isolated by using helper phage CSM13 and sequenced with 
Taq Polymerase (Promega, Freiburg, F.R.G.) according to the protocol of the commercial supplier. 

15 

Plasmid Construction 



The staphylococcal tetracycline resistance plasmid pT18l has been sequenced (Kahn et a/., Plasmid 
70:251 -259 (1983)) and found to contain a single Ndel site within the pre -gene which is not necessary for 
20 plasmid replication (Gennaro et al , J. Bacteriol. 769:2601 -2610 (1987)). The multiple cloning site (mcs) of 
the E. coll vector pUCI9 (Yanisch - Perron et al.. Gene 33:103-119 (1985)) was inserted into the Ndel site 
to form pT18lmcs (see Figure 14). 

A staphylococcus - E. coll shuttle vector, pCUl (Figure 10) was constructed from pCLP100, a derivative 
of the staphylococcal chloramphenicol resistance plasmid pC194 (Horinouchi et al, J. Bacteriol. 750:815- 
25 825 (1982)) and the E. coli vector pUCI9. PCUl is stably maintained in both hosts with an insert size up to 
approximately 6 kb. pT18lmcs and pCUl are compatible in staphylococci and were used to subclone DNA 
fragments from pTG32. 

A Hindlll fragment of pTu32 was cloned in pUCI9 and used as a probe in Southern hybridization to 
identify further restriction sites near the Hindlll fragment (Figure 8C). 
30 The 13.5 kbp Bglll fragment of the 54 kbp episomal element pTG32 from S. epidermis was subcloned 
in pT18lmcs to yield pTepi!4 (Figure 8A). For DNA sequencing subclones were made in the E. coli vector 
pUCI9 (Yanisch - Perron et a/., Gene 33:103-119 (1985)) and pBluescript ll R (Stratagene, Heidelberg, 
F.R.G). Single stranded RNA probes were obtained from DNA cloned in vector pSPTI8/!9 (Boehringer 
Mannheim, Mannheim, F.R.G.). 

35 

Gene Analysis 

Sequencing the DNA region adjacent to the epidermin structural gene, epi A, revealed five additional 
complete open reading frames epi B, C, D, P and Q inside the 13.5kbp Bglll fragment of pTu32. 
40 As can be seen in Figure 9, directly adjacent to the sequence encoding for EpiA separated by only 50 
nucleotides from the epiA ochre codon there is a large open reading frame preceded by a S/D sequence 
which spans 2,970 bp. A TTG codon for leucine which can also act as a translation start codon in 
staphylococci is in appropriate distance (86p) to a S/D sequence. This open reading frame is designated 
epiB and as described herein can successfully be used for the complementation of epidermin biosynthesis 
45 mutants and an essential role in epidermin biosynthesis. 

The protein coded for by epiB, starting from the TTG (Leu) has a molecular weight of about H5kDa, a 
net charge of -3 at pH7, and is moderately hydrophobic (41% hydrophobic residues) as may also be 
predicted from a hydrophilicity plot according to Kyte et al., J. Mol. Biol. 757:105-132 (1982). 

At the 3* end of epiB no palindromine structure characteristic of transcription termination can be seen, 
so There is, however, a I22bp overlap with an other reading frame epiC, shifted by - 1 base pair also to be 
seen in Figure 9. 

We have established this to be no artefact by independently cloning and sequencing the respective 47 
kbp Hindlll -fragment twice from two independent plasmid isolations. This was also confirmed by mutant 
complementation with an epiC containing fragment as described herein. 
55 Inside the overlapping region of epiB and epiC reading frames the first TTG codon (Leu) which is only 

36bp 3* to an AGGA element serves as a translational start codon, indicating that both reading frames 
overlap by about 40 codons. The actual amino - terminus of the EpiC protein was determined by N - 
terminal sequencing. Reading frame epiC encodes a protein with 445 amino acid residues commencing with 
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starling codon TTG (Leu). The reading frame epiD directly follows 3* to epiC with a start ATG 86p 3' to a 
AGGAGG S/D sequence. 3* to epiD is a classical rho dependent transcription terminator structure; epiD 
encodes a protein of 181 ammo acid residues with ATG (Met) on start codon. 

None of the proteins EpiB, C, D, P and Q show any similarity with protein sequences filed in the protein 
data bases Swiss Prot and Gene Bank, and thus represent unknown types of enzymes and regulatory 
proteins. 



Transcription of the biosynthetic genes 

Single stranded RNA probes were obtained by cloning the desired fragment in a pSPT 18/19 vector 
system (Boehnnger Mannheim, Mannheim, F.R.G.) as described above. 

Two transcripts differing considerably in size were obtained as illustrated in Figure 10. A hybridization 
probe specific of epiA identified a small transcript of about 300bp. Transcripts of similar size were also 
found for the lantibiotics nisin (Buchmann et a!., J. Biol. Chem. 253:16260-16266 (1988)) and subtilin 
(Banerjee et a/., J. Biol. Chem. 263:9508-9514 (1988)). Additionally a large transcript of approximately 5kb 
can be identified with a hybridization probe specific for epiB. As there were no E. coii- like promoter 
sequences in front of epiB, whereas appropriate sequences were located 5' to epiA it can be seen that the 
epiA promoter acts as a promoter for a polycistronic mRNA. 



20 Downstream open reading frames 

The open reading frames epiP and epiQ are located on the opposite DNA to epiB, C and D with epiQ 
sharing a termination structure with epiD a perfect hairpin with a 6bp loop. 

Exactly within this loop structure the TAA stop codons for both reading frames epiD and epiQ share two 
25 of three nucleotides. 

The epiP reading frame starts with an ATG codon which is in appropriate distance (6 bp) to a S/D 
sequence. Taking the ATG codon as the translational start of epiP a protein of 461 amino acid residues with 
molecular weight of 5l.8kD. epiP shares characteristic homologies with the conserved amino acid motives of 
serine proteases (see Figure 11) indicating that epiP is implicated in cleaving the mature lantibiotic from the 
30 modified prepeptide. 

The epiQ reading frame also starts with an ATG codon and encodes 205 amino acid residues (Figure 
9). A S/D sequence is present 6 bp distance to the ATG codon and a molecular weight of 243 kD can be 
deduced from the DNA sequence. The epiQ protein shares characteristic homologies with PhoB (see Figure 
12) which is a positive regulatory factor for the phosphate regulatory of E. coii so that epiQ is implicated as 
35 a regulatory factor in lantibiotic synthesis. 

Preceding epiP is an E. coii- like -10 region (5* - TATAAA) I2bp in front of the S/D sequence which 
may serve as a promoter in staphylococci. The distance between the epiP stop codon and the ATG start 
codon of epiQ is only 10 nucleotides and the epiQ S/D sequence overlaps with the epiP termination codon 
as shown in Figure 9. 

40 5 ? to epiA, B, C, D a further reading frame with opposite orientation can be seen which potentially 
encodes a maximum of 148 amino acids. A characteristic S/D sequence is present but none of the 
previously described start codons for staphylococci (ATG, TTG, GTG). With a -1 frame shift a further 
reading frame follows which exceeds the isolated Bglll fragment illustrated in Figure 9. 

These two reading frames are homologous to a single open reading frame, gdm Y, identified adjacent to 

45 the structural gene of gallidermin (Schnell, N.. Biosynthese der Peptid- Antibiotika Epidermin und 
Gailidermm, Doctoral Thesis, University of Tubingen, F.R.G. (1989)). The homologous reading frames on 
the S. epidermis plasmid are designated epiY' and epiY". 



Example 4 

S. carnosus TM300 was transformed with the plasmid pTepil4, prepared as described above, using 
standard techniques. The transformed strain was then grown on BM - media (see above). 

The resulting transformants were found to be capable of inhibiting the epidermin sensitive tester strain 
Micrococcus luteus ATCC934I. In this assay 1 ml of an overnight culture of M. luteus (adjusted to an 
OD573 of 1.0) was added to 500 ml molten BM-Agar. Petri dishes usually contained 10 ml of this agar. 
Dilutions of S. epidermis cultures were spread on the agar surface. Epidermin positive colonies were 
detected as a zero of growth inhibition of M. luteus around the colonies. 
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Cells were grown on 3% meat extract, 3.8% malt extract, 0.6% CaCI 2 x2H 2 0 and 4.6% NaCI, pH6.5. 
According to the transformation used, tetracycline or chloramphenicol was added. After 24 h incubation 
(37 8 C, 160 rpm) in 500 ml Erlenmeyer flasks with one extension containing 100 ml medium, the culture 
both was centrifuged at 10,000 rpm in a Servall centrifuge for 10 min. 

Supernatants of liquid transformant cultures were purified by adsorption chromatography (XAD1180, 
impurities eluted with water/methanol (1:1) and epidermin eluted with methanol/0.1 N HCI (9:1); after 
evaporation the eluate was adjusted with 3N NaOH to pH 3.5 and filled up with water to 10 ml) and detected 
by HPLC chromatography. The inhibitory activity co- migrated with mature epidermin at 6.75/6.76 min (see 
Figures 13A and 13B). Untransformed S. carnosus culture media treated similarly had no peak in this 
elution region (6.72 to 6.79 min, Figure 13C). These results clearly confirmed the heterologous epidermin 
biosynthesis in S. carnosus and demonstrated that pTepil4 contains all information necessary for epidermin 
biosynthesis. 

As pTepi!4 contains the I3.5kbp Bglll fragment this indicates that the epiY' and epiY" reading frames 
are not necessary for the production of epidermin in this system as epiY' lacks a translational start codon 
and epiY'* is incomplete on this fragment. 

Example 5 



A number of epi"mutants of S. epidermin TG3298 were prepared by ethylmethane sulfonate (EMS) 
mutagenesis. This procedure was carried out according to Miller, J.H., Experiments in molecular genetics. 
Cold Spring Harbor Laboratory; Cold Spring Harbor, N Y. (1972). The mutants were screened for epidermin 
production, or lack of epidermin production using the M. luteus assay described above. Epi" mutants were 
transferred several times to test their stability. Of the 40 epi" mutants isolated, only 10 were stable; the 
unstable mutants produced epidermin again after several transfers. All stable epi" mutants still contained 
plasmid pTu32 which suffered no deletions or rearrangements as tested by restriction endonuclease 
analysis. The 10 epi" mutants were used for complementation studies. 

Various restriction fragments of plasmid pTU32 were cloned in $. carnosus to test for heterologous 
epidermin production. The fragments were inserted into plasmid vectors T181mcs and pCU1 as described 
above and the various ORFs which were subcloned as shown in Figure 16B. 

Cloning was first carried out in S. carnosus (by protoplast transformation (Gotz et al., FEMS Microbiol. 
Lett. 40:285-288 (1987)) or E. coli (using CaC1 2 ; Cohen et al., Proc. Nat. Acad. Sci. USA 69:2110-2114 
(1972)) and then the recombinant plasmids were isolated and transferred into the various S. epidermis epi 
mutants by electroporation (Augustin et al., FEMS Microbiol. Lett. 66:203-208 (1990)). Enzymes used for 
molecular cloning were obtained from Boehringer Mannheim (Mannheim, F.R.G.), BRL (Eggenstein, F.R.G.) 
or Pharmacia (Sweden). This indirect transformation method was necessary since transformation of S. 
epidermin strains was only successful with circular covalently closed (ccc) plasmids; when ligation products 
were used, transformants could only be isolated occasionally. 

The results of the complementation studies are summarized in Table 1 . 
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A series of plasmids were constructed which carry various epi genes (A, B, C, D, P and Q) (Figure 
16B). Two plasmids pTepi!4 and pTepiABCDQ were able to complement all epi"mutants. The other 
constructed plasmids pCUepiABC, pTepiAB, pCUepiCDQ, pCUepiB, pCUepiA, , pCUepiA 2 , pCUepiDQ and 
pCUepiQ contained the indicated genes. 
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The various plasmids were able to complement only certain classes of mutants which are classified 
herein as follows: 
EMS 5 and 6 - epiA mutants, 
EMS 18, 33 and 45 - epiB mutants, 
5 EMS 12, 13, 19 and 39 - epiC mutants, 
EMS 1 1 - epiD mutant. 

The results as shown below indicate at least that the four ORFs epiA, B, C and D are required for 
epidermin biosynthesis. 

The plasmid pCUepiAi carries the structural gene epiA as the only complete ORF and an additional 
w 1400bp upstream and 602bp downstream, the latter encoding 190 amino acids of the epiB N - terminus. 
Transformation using pCUepiAi resulted in the complementation of the epidermin mutants EMS 5 and 6 
identifying them as epiA mutants. The smaller epiA -containing Seal fragment cloned in both orientations in 
pCUepiA 2 failed to complement the epi" mutants as the epiA promoter was cut by this enzyme. 

pCUepiB carries a BstN1 fragment containing the complete epiB and an upstream region of 100bp 
75 which includes 75bp of the 3' terminus of epiA; the epiA promoter is missing. Transformation with pCUepiB 
failed to complement any S. epidermis mutant to epidermin production, indicating that epiB lacks its own 
promoter and is very likely co -transcribed from the epiA promoter. 

This is in agreement with the results obtained with pTepiAB (Figure 16B; Table 1) which contains epiA 
promoter and the complete epiA and B genes and the use of which complements both the epiA and epiB 
20 mutants. 

Plasmid pCUepiCDQ was able to complement both epiC and epiD mutants and plasmid pCUepiDQ was 
only able to complement the epiD mutant (Table 1). The complementation was independent of the 
orientation of the cloned DNA fragment. These results show that both epiC and epiD possess their own 
promoters. 

25 

Example 6 

The epiA mutated pTQ32 derivatives were isolated from EMS 5 and 6 and the respective epiA ORFs 
were sequenced. Both plasmids had point mutations within epiA; in the EMS 5 plasmid the codon AGT 
30 (Ser 3 ) was changed to AAT (Asn 3 ) and in the EMS 6 plasmid the codon GGA (Gly 10 ) was changed to GAA 
(Gin 10 ); both these mutations were located at crucial sites within the unmodified epidermin. 

Example 7 

35 An epiB (on a BstN1 - fragment) was put under the control of the promoter on plasmid pPS4 (Figure 
17). The resulting plasmid pPS4epiB was able to complement the epiB mutants EMS 18, 33 and 45. A 
plasmid containing epiB in the opposite orientation did not complement the mutations. This also establishes 
that pCUepiB was unable to complement any of the EMS mutants, because the epiA promoter is missing. 

40 Example 8 

As described above, the presence of pTepi4 (Figure 16A) resulted in epidermin biosynthesis in S. 
carnosus: however, the presence of pTepiABCDQ did not. The minimum size of DNA required which leads 
to heterologous epidermin expression in S. carnosus was determined by complementing S. carnosus - 

45 (pTepiABCDQ) with distally located DNA fragments (Figure 18). Transformation of S. carnosus - 
(pTepiABCDQ) with plasmids pCA44 - 90, pCA44-91 and pCA44-92 led to epidermin production, 
pCA44-92 containing the complete epiQ and epiP ORFs consisted of the smallest DNA fragment able to 
complement epidermin production. These results indicate that the epidermin biosynthetic genes are 
clustered within an 8kb DNA fragment containing the six ORFs; epiA, B, C, D, Q and P and that no other 

so genes are involved in epidermin biosynthesis. 

In these examples staphyloccal plasmid DNA was prepared by the cleaved lysate method (Makmo ef 
a/., J. MoL Biol. 190:37 - 44 (1986)). Cells were lysed by the addition of lysostaphin (8 ug/ml) and the DNA 
was isolated by CsCI - centrifugation. E. coli supercoiled plasmid DNA was prepared by the modified 
alkaline lysis method (Birnboim et al., Nucl. Acid Res. 7:1513-1518 (1979)). 

55 The DNA sequence of the PCR -amplified epiA - containing fragment and the two mutated epiA regions 

of the S. epidermis mutants, EMS 5 and 6, was determined by double - stranded DNA sequencing using 
the dideoxy procedure (McMaster et al., Proc. Natl, Acad. Set. USA 74:4835 - 4839 (1977)), the 
"sequence" list of Pharmacia and (a- 3S S)-dATP from Amersham. Primers used for DNA sequencing and 
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PCR amplification were synthesized using the DNA - synthesizer of Applied Biosystems. The sequences of 
the two primers for PCR amplification of epiA are as follows: 

a) 5' - GGGTTTTAGG(TA)ATCCTTTTTAATAAATTTTTAGGAG -3' 

b) 5' - CCTCAAAATTAAGACG(A)GAT(G)CCTCTATTGAAGCCC - 3' 

5 Primer a) binds in front of the RBS of epiA and primer b) after the epiA stop codon. These bases 

indicated by bold letters represent (shown in brackets) used to create BamHI sites in front and at the end of 
epiA; the epiA promoter is absent in the amplified DNA fragment. 

For determination of the DNA sequence of the mutated epiA in the mutants EMS 5 and 6, plasmid 
pTu32 was isolated and the DNA region was amplified by PCR using another set of DNA primers binding 
70 upstream of the postulated epiA promoter region (5* - GGTTTGGTTATTTTCC - 3') and downstream of the 
stop codon (5' - CCTCAAAATTAAGACAGAGCCTC - 3'); the DNA sequence of epiA is also shown in 
Schnell et a/., Nature (Lond.) 333:276-278 (1988). 

Example 9 

The epi D gene was isolated from the plasmid pTepi 14, multiplied by PCR amplification and cloned 
into the Stul- restriction site of vector plH902 (New England, Biolabs) by "blunt end" ligation, with the result 
that the epi D gene is fused without any intervening base pairs immediately at the Factor Xa- cleavage site 
of vector plH902, which was then transformed into E. coli 

Cultivation of the E. coli resulted in expression of the enzyme Epi D fused to the Maltose binding 
protein of E. coli. The resulting fusion protein was purified by affinity chromatography on Amylose column 
material. 

It was found that the enzyme epiD could be cleaved from the fusion protein in low yield by means of 
Factor Xa. A modification of the amino acid sequence at the cleavage region will enable the cleavage rate to 
be improved. 

The fusion protein was sequenced at the DNA level from the fusion position to the 3' end of epiD. The 
epiD sequence corresponded to the wild type sequence of S. epidermis. 

Plasmid pCA44 is deposited in Staphylococcus carnosus TM 300 at the DSM Deutsche Sammlung von 
Mikroorgamsmen und Zellkulturen GmbH, Mascheroder Weg lb, D-3300 Braunschweig, Germany, under 
No. 6863, the deposit being dated 23rd December 1991. 

Plasmid pPS4 is deposited in Stephylococcus carnosus TM 300 under No. 6864 at the same 
depositary, deposit being dated 23rd December 1991. 

Claims 

1. A recombinant DNA molecule having a sequence corresponding to one or more DNA sequences 
contained in a Bglll fragment of the 54 kbp episomal element pTG32 from S.epidermis and which codes 
for at least one protein having Epi B, Epi C, Epi D, Epi P or Epi Q enzymatic activity; or 
a DNA molecule which is capable of hybridizing to the aforesaid Bglll fragment and which codes for at 
least one protein having one of the aforesaid enzymatic activities. 

2. A recombinant DNA molecule according to claim 1 , which encodes the protein Epi B, Epi C, Epi D, Epi 
P and/or Epi Q having the amino acid sequence(s) set forth in Figure 9. 

45 3. A plasmid comprising a DNA molecule according to claims 1 or 2 under the control of a promoter 
regulator sequence coding for a protein having Epi B, Epi C, Epi D, Epi P or Epi Q activity but wherein 
one or more of the DNA sequences coding for the other said proteins or Epi A are absent. 

4. A protein as defined in claim 1 . 

in 

5. The protein Epi B. Epi C, Epi D, Epi P or Epi Q in the absence of impurities with which it is normaiiy 
associated. 

6. A method of preparing Epi B, Epi C, Epi D, Epi P or Epi Q, which comprises inserting a DNA sequence 
55 as defined in claim 1 or claim 2 into a plasmid vector such that it is under the control of a promoter 

regulator, inserting the resulting plasmid vector into a suitable host, culturing said host so that the 
protein is expressed, and isolating the protein. 
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7. A method of producing a protein having at least one of the structural features of a lantibiotic which 
comprises inserting a DNA sequence coding for a predetermined amino acid sequence in a vector 
plasmid such that it is under the control of a regulator promoter, transforming the resulting plasmid 
vector into a suitable host and cultivating the host so as to express the said DNA and isolating the 
protein, wherein the host is also transformed by a DNA sequence or sequences comprising functionally 
operable genes coding for epi B, C, D, P or Q or a variant thereof having epi B, C, D, P or Q activity, 
which genes are expressed during the said cultivation of the host so that the resulting protein has at 
least one of the structural features of a lantibiotic. 

8. A method according to claim 7, wherein the DNA sequence coding for the predetermined amino acid 
sequence and the genes coding for Epi B, C, D, P or Q or variants are contained in the same plasmid 
vector which is used to transform the host. 

9. A method according to claim 7 or 8 wherein the said DNA sequence contains or is inserted 
immediately downstream of a DNA sequence coding for the pre - peptide sequence - 30 to - 1 of 
pre - epidermin. 

10. A fusion protein comprising one of Epi B, C, D, P or Q and an auxiliary protein. 

11. A fusion protein wherein one of Epi B, C, D, P or Q is fused to an auxiliary protein such that the 
junction between the Epi protein and the auxiliary protein can be cleaved by an enzyme 

12. A fusion protein according to claim 10, wherein said auxiliary protein facilitates secretion of the fusion 
protein from a selected host. 

13. A fusion protein according to claim 10, wherein the auxiliary protein facilitates purification by affinity 
chromatography. 

14. A fusion protein according to claim 10 in which the auxiliary protein is the Maltose binding protein from 
E.coli. 

15. A DNA sequence encoding a fusion protein according to claim 10. 
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FIG. 2B 
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FIG. 7 A 

EMS 5 plus pCuGal ( Gallidermin ) 
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FIG. 7B 

Gollidermin- Standard 
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AGATCTTGTGTTATATAACTAAACAAATTTCTCCATTCGTATTTAGAAAATTGACT 
DQTIYSFLNRWEYKSFQS 

60 80 100 120 

TTT AT C AAGTTT ATC C AAAT AT AT ATTT C C AGT AT ATTCTGT ATTT AA C C C AG CT AAT AT ATTT 
DLKKDLYINGTYETNLGALIN 

140 160 180 

AATAATGTACTTTTTCCACACCCACTTTCACCTATAATATTGTAGATATAACCrTTATGAAGAT 
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200 220 240 

C^AAACTTATAGAATTTATTATTTGTTTATTGTCTTTTGTGAAGTTC 

LSISNI IQKNDKTFNLDNIEM 

260 280 300 

TTTTTGAACAAAGTTATTGTAAGTTGTTTTAATAGTTAAT 
KQVFNNYTTKITLVEEPEKNI 

320 340 360 

TTTAAAATTCTATCTGAAGATCCAATTGCTCGTTGTACTTCCGTCCAATAAGATGTAATAGATA 
KLIRDSSGIARQVETWYSTISV 

380 400 420 440 

CTATTGGATTAATAATTTGAAATAAATATAAAACATAAGCAAACATATCTCCGCTTTTCATCAT 
IPNI IQELYLVYAFMDG SKMM 

460 480 500 

ATTATTTTCCATTAAGTAATAACCCAAAAATAAAATACCAAAAATGTTAATAAATAGAATTAAG 
NNEHLYYGLFLIGFINIFLIL 

520 540 560 

TTCATAATTGGTTCGAAAAAAGATAATACTTTGATCTTATGTAACTC™ 

NMIPEFFSLVKIKHLEI DFINK 

580 600 620 

TTAATAGGGTATAGTTTTTTATTTTTTCGATATTATATGTACTTAAAG 

LLTYNKIKEINYTSLTKILKI 

640 660 680 

TGTAGATAATCTATTACTATAATAAGAAGATAATTTAGCAGTAGCTTCTTGAGATTTACTTGAT 

TSLRNSYYSSLKATAEQSKSS 

700 720 740 760 

ACTCTTTTCATTATATTTCCTATAGGTAGTATTACAATTATCAATATAGGTAATGTACACACTA 

VRKMINGIPLIVIILIPLTCVL 
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FIG. 9 (2/1D 



780 800 820 

AATATAATGTCAAGGTTTTGTTAATTATATATAAAAATATTAGTGATACTATAACTGAAAATAA 

YLTLTKNI IYLFI LSVIVSFL 

ocNYIFINTISYSFIF 

840 860 880 

ATTCTACAGAAAAAACTCTAGTTATGTTCATAGTATCGTTTACTAACCTACTAGTTAAGTTACT 

N am 

EVSFVRTINMTDNVLRSTLNS 

900 920 940 

TGCTGAGTTTTTTAAGTGAAAACTATAAGGTAACTTTATCACTTTATTCCATGT 
ASNKLHF SYFLKIVKNWTVSR 

960 980 1000 

ATGTTTTGTATTATTTTTTGACCTATATATCCAAGAATATAAGTAGAAACACCAGAAAATATTA 
INQIIKQGIYGLIYTSVGSFIL 

1020 1O40 1060 1080 

AAGTCAGACCAAAACATATAATAATGATTACAATTTTATCTGTTGATAAGCTAGATTTGTTTAA 
TLGFCI I IIVIKDTSLSSKNL 

1100 1120 1140 

GGCATTTCTAATTATTAAAGGAATGTATAATGAAAAACTAGTTCCAATCAAACTAAATATTAGT 
ANRIILPIYLSFSTGILSFIL 

1160 H80 1200 

CCAATACTTAAAAGTAGAGTGTTAGGTTTGGTTATTTTCCATAAATCATATAGACCTTTGATAA 
GI SLLLTNPKTIKWLDYLGKII 

S/D 1240 1260 

TATCATCACCTTTTAAACTTTATATCATTAATATAATGTTTAGGAAAAGTAGAAGAAAATTACA 

D D G K L S < •piY 

IR 1280 1300 1320 

CTTTTGT AATTTTCTG AAT AT ACAT AGTACT 

134 0 IR 1360 S/D 1380epiA > 1400 

AGGGTTTTATAATCCTTTTTAATAAATTTTTAGGAGTGTTTAAAATGGAAGCAGTAAAAGAAAA 

M E A V K E K 

1420 1440 1460 

AAATGATCTTTTTAATCTTGATGTTAAAGTTAATGCAAAAGAuATCTAACGATTCAGGAGCTG^ 
NDLFNLDVKVNAK ESNDSGAE 

1480 1500 1 520 

CCAAGAATTGCTAGTAAATTTATATGTACTCCTGGATGTGCAAAAACAGGTAGTTTTAACAGTT 
PRIASKFICTPGCAKTGSFNS 

1540 1560 1580 

ATTGTTGTTAATTCAGAAGAATTAGATTGGCAGGGCTTCAATAGAGGCTCTGTCTTAATT'I^GA 

Y C C oc e P iB > 
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S/D 1600 1620 1640 

GGTGAAATAGAATTGGATAATATATTTGTTCCATCGAATATATATATGGTAAGAACTCCTATAT 
GEIELDNIFVPSNIYMVRTPI 

1660 1680 1700 1720 

TTTCAATTGAATTATATAATCAATTCTTAAAATCTGAC 

FSIELYNQFLKSDNIDYDLILQ 

1740 1760 1780 

AAACGATATTTTTAAAGAATCTATAATGACAACGACATATAATCTTTATCAAAGTATTGGCAAA 
NDIFKESIMTTTYNLYQSIGK 

1800 1820 1840 

ATAGACTGGGAAAAGGATAATAAAAAAACCAGAAATGTAAAAGAAAGTTTATTAAAATATCTCA 
IDWEKDNKKTRNVKESLLKYL 

I860 1880 1900 

TAAGAATGAGTACTAGAAGTACACCATATGGAATGCTAAGCGGTGTAGCTTTAGGGGAATTTAG 
IRMSTRSTPYGMLSGVALGEFS 

1920 1940 1960 

T G AAAAT AAT AAT ATT AAAATT AAGG ACT CTT C G TTT C AT AAAAAAG ATGT AAAAAT AG ATGGG 
ENNNIKIKDSS FHKKDVKIDG 

19 80 2000 2 02 0 ggj O 

CAATGGTTATATAAATTAGTCCATTATTTAGAAAGCGATTACACATATTATAAAGACAGTTTTG 
QWLYKLVHYLESDYTYYKDSF 

2060 2080 2100 

TCATATGGAATCAACAAAATTATATTTATAACAATCGTTTATATTTAGATAATAATTCATCAAT 
VIWNQQN Y IYNNRLYLDNNS SI 

2120 2140 2160 

CACTGAAAATAAAAGAAATGATGTATTATCTGTCAAATACAATTCrrATATTAGTGTTTATACAT 
TENKRNDVLSVKYNSILVFIH 

EcoRI _2180 2200 2220 

GAGAATTCTAAAAAAAATATTACTTATGAAGAACTTGTACAATTGATATCTAGTAAGTACAGTA 

ENSKKN ITYEELVQLISSKYS 

2240 2260 2280 

TAGAAAATAAAGAAGAAGTAAAAGTATTTGTTCAAGAACTCATAAATAAAGAAATTATATTTTC 

I ENKEEVKVFVQ ELI NKEI I FS 

2300 2320 2340 2360 

TGATTTGAGACCTACATTAGAGAATAAAAATCCTTTAGATTACATTATTAATAGTTTAAATCCA 

DLRPTLENKNPLDYI INSLNP 

2380 2400 2420 

AAAAAT AGTTTAGTTGGAACACTT ATT AATAT^TCTAATG AAA TTACAAAA 
KNSLVGTLINI SNEITKYSKM 
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2440 2460 2480 

CTTTAGGAAAAGGAGAATATAAATATTTAGATATTGTTAATTTAATGTCACAATTA 
PLGKGEYKYLDIVNLMSQLFVS 

2500 2520 2540 

TAAAAACTATTTGCAAATAGATACCTATATAGATTATTCAAGAAATGAATTAAAACAAAGTTTA 
KNYLQIDTYIDYSRNELKQSL 

2560 2580 2600 

GCTGATAATATTAGTGAAGCAGCATATATTCTCTGGTTATTATCTCCTAATCATTTTGGTACAA 
ADNISEAAYILWLLSPNHFGT 

2620 2640 2660 2 li£ 

AAACTATTAGGAATTATCACGAATTTTTTATGGATAAATATGGATTTGAACAACTAGTAAATTT 

KTIRNYHEFFMDKYGFEQLVNL 

2700 2720 2 740 

AAAGCAATTGCTCTCAGATATAAATGGATTTGGCTATCCCAAAAAAGACAGTTATAGTTTTTCT 
KQLLSDINGFGYPKKDSYSFS 

2760 2780 2800 

AATAACATTGCATTTTTAAAAGAAAAGTATTTGCTTGCAATTCAAAA 
NNIAFLKEKYLLAIQNNSHIE 

2820 2840 2860 

TAACAGAAAACGACGTTAAAAATTTAGAAAAGAATAATACAGTTTCTAAAATCAATGCGCCTGT 
ITENDVKNLEKNNTVSKINAPV 

2880 2900 2920 

TT CAA CTGAAAT AT AT AGT GAG AT AT ATTTTG GAAATT CAATAAAAGG TTATGAGG ATTTTG C C 
STEIYSEIYFGNSIKGYEDFA 

2940 2960 2980 3000 

GTGATAAGTCCAATATTAGGATCTTTTAATGCCGGTGCAACTTTTGGAAGGTTTACGGGAAATT 
VI SPILGSFNAGATFGRFTGN 

3020 3040 3060 

TCAATATAAAGAAAAAAAATCAATTACAAAAAGAAATAGTGCATCATTACAATAATTACATGAA 
FNIKKKNQLQKEIVHHYNNYMN 

3080 3100 3120 

TGAAAATGGTTTAGAAATAAGCCAATTAAATGAAGGTCCTCTTAACTCAAGAAATGTAAATATT 

ENDLEISQLNEAPLNSRNVNI 

3140 3160 3180 

TTGAATAATAATAGAATATATAATACTTGTTTAAATTTAAATTTACCTAAAAGTGATATAGATA 
LNNNRIYNTCLNLNLPKSDID 

3200 3220 3240 

TAAATGACATATTTATTGGAGCTACATTTAACAAACTTTATCTATATTCTGAAAAACATGATTC 
INDIFIGATFNKLYLYSEKHDS 
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3260 3280 3300 3320 

AAGAATTGTATTCGTATCTAATTCAATGTTTAATTATGAGTTTGGATCTGAATTATACAAATTT 
RIVFVSNSMFNY EFGSELYKF 

3340 3360 3380 

TTAAGAGAAATTTCATTTGAAAAAACAAAATTTATACAACCT 
LREISFEKTKFIQPITEEG ID 

3400 3420 3440 

CATTACCTTTTTGTCCAAGAATTATTTATAAAAATATTATTTTA 

SLPFCPRIIYKNIILKPATWKI 

3460 3480 3500 

AAATTCAGAAATGTTTTCTGAAACTGAAAATTGGTTAAATAGGTTCGCAACTATTAGAGAAAAA 
NSEMFSETENWLNRFATIREK 

3520 3540 3560 

TGGCATATTCCAAAAGATGTAATTATTGCTTTTGGAGATAATCGATTGCTATTAAATTTATTAA 
WHIPKDVI IAFGDNRLLLNLL 

3580 3600 3620 3640 

ATGACAAGCATCTCATTATACTAAAAAAAGAACTAAAAAAACATGGTAGGATTCGAATATTAGA 

NDKHLII LKKELKKHGR I R I LE 

Hindll l 3 660 3 68 0 3 7 00 

AAGCTTTATCAATGAATCTAATAATGAGAGAATGTTAGAAATTGTTACGCCATTATATAAAAAA 
SFINESNNERMLEIVTPLYKK 

3720 3740 3760 

ACTAGTTTAAAAGAACAATCTTTCATTATACCTAAAAATAGAAATAAGCACTTCAATAATCTTA 
TSLKEQSFIIPKNRNKHFNNL 

3780 3800 3820 

AAG ATT GG TTTT C AATT C ATTT AAG T ATT C CT AAAA C AT A C C AAG AT AATTTT ATT CAAG ATT A 
KDWFSIHLS IPKTYQDNF I QDY 

3840 3860 3880 

TCTATTACCATTTATAACGGAATTAAAAGTTAATAATTTTATTA^ 

LLPFITELKVNNFINKFFYIK 

3900 3920 3940 3960 

TTTAAAGAAGATGAAGATTTTATAAAATTAAGATTATTAAGAGAAGATGAAGATTATTCTCAAA 
FKEDEDFIKLRLLREDEDYSQ 

3980 4000 4020 

TTTATTCTTTCATAAAAAATTGGAAAGATTATTGCTTATTAAA 

IYSFIKNWKDYCLLNSELYDYS 

4040 4060 4080 

TATAGTTGATTATGTTCCTGAAGTATATAGATATGGTGGTCCACACGTAATTGAAGATATTGAG 
IVDYV'PEVYRYGGPHVI EDIE 
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4100 4120 4140 

AATTTTTTTATGTATGATAGTCTATTATCAATAAATATAATACAATCAGAGTTCAAAATTCCAA 
NFFMYDSLLS INIIQSEFKIP 

4160 4180 4200 

AAGAATTTATCGTTGCTATATCAATAGATTTTTTATTAGATTATTTAGAAATTAATAAAAGTGA 
KEFIVAISIDFLLDYLEINKSE 

4220 4240 4260 4280 

GAAAGAAGAAATTTTAATTAATAATGCGGAAGATTTATATCGTAGTAATGACATAAGAGAATAT 
KEEILINNAEDLYRSNDIREY 

4300 4320 4340 

AAAAATTT ATT AG CT AAACTT A C C AAT C CT AAAAATG ACT AT G AAATTTT AAAAAAAG AATTT C 
KNLLAKLTHPKNDYEI LKKEF 

4360 4380 4400 

CGAATCTTCATG AATTT CTATTT AAT AAAATTAGTATTTTAGAAAATCTI^ 

PNLH E FLFNK I S I LEN LKKTLQ 

Hindll l s^B_ «pic > 4460 

AAAAAGCTTATATACTTCACGTTCTAGGATAATTGGCAGTTTTATACACATGCGTTGTAATAGA 

KSLYTSRSRI I G SFIHMRCNR 

Pl] AVLYTCVVIE 

4480 4500 4520 

ATATTCGGTATTAATCCTGAAAAAGAAAAATTTGTTTTATCTATTTTTAATGAAAT^ 
IFGINPEKEKFVLSIFNEITK 
YSVLILKKKNLFYLFLMKLQK 

4540 4560 4580 4600 

CT AAAAAAT ATTGG G ATGG TTGT G ATT AAT ATT AAT AA C ATT AAAAAAATTTT AG AAAAT AAAA 
TKKY WDGCD oc ococ 
LKNIGMVVININNIKK ILENK 

4620 4640 4660 

TCACCTTTTTTGTCTGACATTGAAAAAGC^ACATATATTATAGAAAATCAAAGTGAGTATTGGGA 
ITFLSDI EKATYI IENQSEYWD 

4680 4700 4720 

TCCTTATACTCTATCTCATGGTTATCCAGGTATAATACITTTTTTAAGCGCATCAGAAAAAGTA 
PYTLSHGYPG IILFLSASEKV 

4740 4760 4780 

TTTCATAAAGATTTAGAAAAAGTAATACATCAATATATTAGAAAACTAGGCCCITATTTAGAAA 
FHKDLEKVIHQYIRKLGPYLE 

4800 4820 4840 

GTGGTATTGATGGATTTTCACTTTTTAGTGGTCTTTCCGGAATTGGATTTGCGCTA^ 
SGIDGFSLFSGLSGIG FALDIA 
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FIG. 9 (7/1D 

4860 4880 4900 4920 

GTCTGATAAACAGTACTCTTATCAAAGTATCTTAGAACAAATTGATAATTTACTTGTTCAATAT 
SDKQYSYQSILEQIDNLLVQY 

4940 4960 4980 

GTTTTTGATTTTTTAAATAACGATGCATTGGAAGTAACCCCTACTAACTATGATATAATACAAG 
VFDFLNNDALEVTPTNYDIIQ 

5000 5020 5040 

GATTTTCTGGTATAGGAAGGTACTTGTTAAATAGAATATCGTATAATTATAATGCAAAAAAAGC 
G FSGVGRY LLNRI S YNYNAKKA 

5060 5080 5100 

ATTAAAGCATATACraAATTACTTCAAAACAATTCATTACT 

LKHILNY FKTIHY SKDNWLVS 

5120 5140 5160 

AATGAACATCAATTTTTAGATATAGATAAGCAAAATTTTCCGTCAGGAAATATAAATTTAGGAT 
NEHQFLDIDKQNFPSGNINLG 

5180 5200 5220 5240 

TAGCGCATGGTATTTTAGGTCCTCTATCATTAACAGCTTTGAGTAAAATGAATGGGATTGAAAT 
L.AHGI LGP LSLTAL SKMNGIEI 

5260 5280 BcoRI 

CGAAGGCCATGAAGAGTTTTTACAAGACTTCACTTCATTTTTGCTCAAACCTGAATTCAAAAAT 
EGHEEFLQDFTSFLLKPEFKN 

5320 5340 5360 

AATAATGAATGGTTCGATCGCrrATGATATATTAGAAAATTATATACCTAATTATTCCGTCAGAA 
NNEWFDRYDILENY IPNYSVR 

5380 5400 5420 

ACGGTTGGTGTTACGGTGATACAGGGATTATGAATACATTACTTTTGTCTGGTAAAGCCTTAAA 

NGWCYGDTG IMNTLLLSGKALN 

5440 5460 5480 

TAATGAAGGCTTAATTAAAATGTCTAAAAATATTTTAATTAACATAATAGATAAGAATAATGAT 

NEGLIKMSKNILI NI IDKNND 

5500 5520 5540 5560 

GATTTAATCAGTCCAACCTTCTGTCACGGACTAGCATCGCACTTAACCATTATTCATCAAGCGA 
DLISPTFCHGLASHLTI IHQA 

5580 5600 5620 

ATAAATTCTTTAATCTATCTCAAGTAAGCACATATATCGATACCATTGTCAGAAAAATTATTAG 
NKFFNLSQVSTYIDTIVRKIIS 

5640 5660 5680 

TCATTATTCTGAAGAAAGTAGTTTTATGTTCCAAGACATAGAGTACTCATACGGACAAAAAATT 

hysee'ssfmfqdi EYSYGQKI 
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5700 ECORI 5720 5740 

TATAAAAACAAAGTGGGAATTCTAGAGGGTGAATTAGGTGTTCTTTTAGCTTTACTAGATTATA 
YKNKVGILEGELGVLLALLDY 

5760 5780 5800 S/D 



TTGATACACAAAACCAATCAAGGAAAAATTGGAAAAATATGTTTTTAATAACATAATAGGAGGA 
IDTQN QSRKNWKNMFLTT oc 

5820 mplD > 5840 5860 5880 

ATAAGATATGTATGGAAAATTATTGATATGCGCTACAGCATCGATAAATGTAATTAATATTAAT 
MY GKKLLCATAS I N V I N I N 

5900 5920 5940 

CACTACATAGTTGAGTTAAAGGAAGATTTTGATGAAGTTAATATATTATTTAGTCCTAGTAG 
HYIVELKQHFDEVNILFSPSS 

5960 5980 6000 

AAAATTTTATAAATACTGATGTTCTCAAGTTATTTTGTGATAACTTG 

KNFI NTDVLKLFCDNLYDEIKD 

6020 6040 6060 

T C CT CTTTT AAAT C AT AT C AAT ATTGT AG AAAAT C ATG AAT AT ATTTT AGT ATT AC CTG C ATCA 
PLLNH INIVENHEYILVLPAS 

6080 6100 6120 

GCAAATACTATTAATAAAATAGCTAATGGTATATGTGATAATCTITTAACrACTGTATGTTTAA 
ANTI NKIANGICDNLLTTVCL 

EcoRV _ 6160 6180 6200 

CCGGATATCAAAAATTATTTATATTTCCAAATATGAACATAAGAATGTGGGGAAATCCATTTTT 
TGYQKLFI FPNMNIRMWGNPFL 

6220 6240 6260 

ACAAAAAAATATTGATTTACTTAAAAATAATGATGTGAAAGTGTATTCCCCTGATATGAATAAA 
QKN I DLLKNNDVKVYSPDMNK 

6280 6300 6320 

TCATTCGAAATATCTAGTGGCCGTTACAAAAACAATATCACAATGCCTAATATTGAAAATGTAC 
SFEI SSGRYKNNITMPNI ENV 

634 0 6360 Te rm i n a tor 6380 

TAAATTTTGTATTAAATAACGAAAAAAGACCTTTGGATTAACAAAGGTCTTTTCTAAT^ 
LNFVLNNEKRPLDoc 

OCCLDKRILI 

64 00 6400 644 0 

TTTATATCCGAGTTTACGTTCATTAATAATTTCTATCTC 
KYGLKRENIIEI EKCNKLSDR 

6460 6480 6500 6520 

AATCGATGGATATATACATTTATTGTATTAGAATCAACAAAGTCITCTGTATCCCACACTCCCT 

LRHI YVNITNSDVFDETDWVGK 
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6540 6560 6580 

TTTTTAATTCCTCTTTTGATACATATCTC 

KLEEKSVYRGLN IYLCRLIKL 

6600 6620 6640 

TTCTATATTAGAAAGATTAACTAAGTAATTATTAAACACAAATTGATGGTTTTCAAA 
EINSLNVLYNNFVFQHNEFDI 

6660 6680 6700 

AAATCATCATTAACATATTTAATATACTTTTTTATTTCATTTAAAATTCT 

FDDNVYKI YKKI ENLIRCLILS 

6720 6740 6760 

TTTTGCTTTCATTATTTTTTATAATATATAAATCT 

KSENNKI IYLD IGLSYFYCKR 

6780 6800 6820 6840 

ACTATAGCTAGTATTACCTGTTATTATAACTATTGGAATTTT^ 
SYSTNGTIIVIPIKGIFEKLF 

6860 6880 6900 

GT AT AAT A CT C A T C AAA CTTTT CAT AC AC AATT AT AAAATTTGG GT CT AT ATTTG AAG AATT AA 
TYYEDFKEYVI I FNPDINSSNI 

6920 6940 6960 

TTGT AATT CTT CT AT CT AATT CT AAAAT ACTTTC AAT AAG AAT AG AATCT AC CTCAC C G AC AAT 
TIRRDLELISEILISDVEGVI 

6980 S/D 7000 7020 

ATTAATAGAAATCATTTTATTCCCTTCATTCTTTAAGT 
NISI < «piQ op EKLLKYVDLKGN 

704a 7060 7080 

ACCATAATGTTTTTTATCCATATATTTTTCTTTTTCTATCC 
GYHKKDMYKEKEIGKKKFLEI 

7100 7120 7140 7160 

GCTGTTTCGGGTTGGTCTTTTAATTGATACTTATCAATTTCTAGTGCTAAAGCT 
ATEPQDKLQYKD I ELALAGSVK 

7180 7200 7220 

TGGGTGCAGCAAGTGATGTCCCrrGATTGATATATGTATCTTCCATTAGAAGAAGTACTTAAAAC 
PAALSTGSQY I YRGNSSTSLV 

7240 7260 7280 

ACTTTGTTTTTGCATATATCCTTTTTCTAACCAAGCATCTTTTCCATACTTATCT 
SQKQMYGKELWADKGYKDLLK 

7300 7320 7340 

TAAGAACCTCCTATCGTCATTAAATCTATAAAATTATTTCCATAATTAGAAAACTCAGAAATAT 

YSGGITMLDIFNNGYNSFESIY 
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7360 £ABSI_ 7380 7400 

AATCATTATCATCGATGGATCCTACAGTCATAACATTATTTAGATTTGCTGGGCTATCATATAC 
D NDDI SGVTMVNNLNAPSDYV 

7420 7440 7460 7480 

CTTTTTTGATGTTTTAGAATTTAGATl w rCTT^ 
KKSTKSNLNRKKNIE K~V K K V N 

7500 7520 7540 

ATACCGTCATTACCCACAGCTGCAACAACAATACTACC , riUU M l"rT , rGAGCATAGTTTATAGCTT 
IGDNGVAAVVISGKKQAYNIAK 

7560 7580 7600 

TCTGTAGTGCATCGTAATCAACrrrTTCATCAT 

QLADYDVKEDDRLKKKNQNDK 

7620 7640 7660 

AATTAAATAATTTCCTAAACTAACGTTGATTACATCATTGTCATCATTTGCTG 
ILYNGLSVNIVDNDDNAADII 

7680 7700 7720 

CCTTTAGATACCCAAAGCATTTCACITTTCTTTGAGCCAAATACTCGGTATACATTCATCTCTA 
GKSVWLMESKKSGFVRYVNMEV 

7740 7760 7780 7800 

CTCCAGGGTTTACACCTTTTAAATTACCGTTTG 

GPNVGKLNGNAGIQGAVLTGH 

7820 7840 7860 

ATTCAATTTATCTITCTTCAAAATTTTTATTTCCT 
NLKDEEFNKNGSEDNESGRFG 

7880 7900 7920 

TTTTTAGGCACTTCATTAACTATCTTATTTATACT 

NKPVENVIKNISKLDTHSSNVG 

7940 7960 7980 

CAGAATCTACTAAAGCAACTTTTGCTTTTTTTCTA 

SDVLAVKAKKRDPSLKYSKGE 

8000 8020 8040 

ATTTGTTATTTTTCGCATATCCCATTGTCTGTCAAATAAATCA 
NT I KRMDWQRDFLDHSGNKKN 

8060 8080 8100 8120 

TTTAAATTTTTTCCTGTCTTTACAGATTTTTCAACTACACAAGTGGAACA 

NLNKGTKVSKEVVCTS CTPNVF 

. 8140 8160 8180 

acttgacgtttttattactctttattagtgaatttaattttgatot 

kvnkn"skilsnlkskstkiqa 
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8200 Hindll 8220 8240 

TGTATGTAGTTCAGGAATTTTATAAGTTAACTCGATAT T TTTTTGTTTAATGGATTCTTTAAAA 
THLEPIKYTLEINKQKI SEKF 

8260 8280 830O 

GTTTTTGCATTATCATATTCAACACTATAATAACTTAATTCTTCATT^ 

TKANDYEVSYYSLEENXSSGFA 

8320 8340 8360 

CATACTCATTTTGCAAAAAAACTAATGACAATATTAAAAAAACAATGAAAAATTTAAA 

YENQLFVLSLILFVI FFKFKM 

8380 S/D 8400 8420 8440 

C AT AT AG CAC CT^T AA CAT ATT ATTTATATT AAACATTAATTTAACACTT ATGTTTTT ACTTTT 

M < «piP 

8460 8480 8500 

TTATTTATATTATCTTTAATAATGTTCTGTTGCAAGATGAAAAATACGAGGTATCAAAGTACCG 

8520 8540 85 60 

ATACAGCGAGTATTACACTCAATTAATTAAAAATAAAATATGTTGTGATTAAAATTTATTTTAT 

8 580 8 600 Kpal l 

AAAAGTATGGGCAATTTATTATTATTCAAGTTAAAACAAAGAGTCCGGGACATAAAGTTTCAGC 

8640 8660 8680 

CTCTTCGTCCTAATTACCAAAAAACTTACTCCAAAATCCTTTTT^ 

8700 
TTTT 
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